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ENDOGENOUS AND NON-ENDOGENOUS VERSIONS OF 
HUMAN G PROTEIN-COUPLED RECEPTORS 

FIELD OP THE INVENTION 

5 

The invention disclosed in this patent document relates to transmembrane 
receptors, and more particularly to human G protein-coupled receptors, and 
specifically lo endogenous human GPCRs with particular emphasis "on non- 
endogenous versions of the GPCRs that have been altered to establish or enhance 
10 constitutive activity of the receptor. Preferably, the altered GPCRs are used for the 
direct identification of candidate compounds as receptor agonists, inverse agonists or 
partial agonists having potential applicability as therapeutic agents. 

1 5 BACKGROUND OF THE INVENTION 

Although a number of receptor classes exist in humans, by far the most abundant 
and therapeutically relevant is represented by the G protein-coupled receptor (GPCR or 
GPCRs) class. It is estimated mat there am some 100,000 genes within the human 

20 genome, and of these, approximately 2%, or 2,000 genes, are estimated to code for 
GPCRs, Receptors, including GPCRs, for which the endogenous ligand has been 
identified are referred to as "known" receptors, while receptors for which the 
endogenous ligand has not been identified are referred to as "orphan*' receptor*. GPCRs 
represent an important area for the development of pharmaceutical products: from 

25 approximately 20 of the 100 known GPCRs, approximately 60% of all prescription 
pharmaceuticals have been developed. 

GPCRs share a common structural motif AO these receptors have seven 
sequences of between 22 to 24 hydrophobic amino acids (hat form seven alpha helices, 
each of which spans the membrane (each span is identified by number, Le, 

30 transmembrane-! (TM-1), transrr»ebrane-2 (TM-2), etc.). The transmembrane helices 
are joined by strands of amino acids between transrnembrane-2 a 
SUBSTITUTE SHEET {RULE 26) 
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transmembrane -4 and transmembrane- 5 , and transmembrane-* and transmembrane*-? on 
the exterior, or -extracellular" side, of the cell membrane (these are referred to as 
1, 2 and 3 (EC-1, EC-2 and ECO), respectively). The 
helices are also joined by strands of amino acids between 
5 transmembrane-] and transmembrane- 2, transmembrane- 3 and tran sm em bra ne^, and 
mmsmcmbrane-5 and transraembrane-6 on the interior, or "intracellular" side, of the cell 
membrane (these are referred to as "intracellular" regions 1, 2 and 3 (IC-1, 1C-2 and IC- 
3), respectively). The "carboxy" ("C") terminus of the receptor lies in the intracellular 
space within the cell, and the -amino" (**N") terminus of the receptor lies in the 
10 extracellular space outside of the cell. 

Generally, when an endogenous ligand binds with the receptor (often referred to 
as "activation" of the receptor), there is a change in the conformation of the intracellular 
region that allows for coupling between the intracellular region and an intracellular **G- 
protem." It has been reported that GPCRs are "promiscuous" with respect to G proteins, 
15 it, that a GPCR can interact with more than one G protein. See, Kenakin, T, 43 Life 
Sciences 1095 (1988). Although other G proteins exist, currently, Gq, Gs, Gi, Gfc and Go 
are G proteins that have been identified. Endogenous figand-activated GPCR conpling 
with the G-protein begins a signaling cascade process (referred to as "signal 
transduction**). Under normal conditions, signal transduction ultimately results in 
20 cellular activation or cellular inhibition. It is thought that the IC-3 loop as well as the 
carboxy terminus of the receptor interact with the G protein. 

Under physiofogiea] conditions, GPCRs exist in the cell membrane in 
equilibrium between two different conformations: an "inactive" state and an "active- 
state. A receptor in an inactive state is unable to link to the intracellular signaling 
25 transduction pathway to produce a biological response. Chang ing the receptor 



BEST AVAILABLE COPY 



WO 01/3*471 



he active stats allows linkage to the tnmsduction pathway (via (he G- 
proteiD) and pro du ces a biological response. 

A receptor may be stabilized in an active state by an endogenous figand or a 
compound such as a drug Recent discoveries, including but not exclusively limited to 
5 rnodifjcatkms to the amino acid sequence of the receptor, provide means other than 

r 

endogenous ligands or drugs to promote and stabili7T, the receptor in the active stale 
conformation. These means effectively stabilize the receptor in an active state by 
simulating the effect of an endogenous ligand binding to the receptor. Stabilization by 
such Hgand^ndependent means is termed "constitutive receptor activation." 
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SUMMARY OF THE INVENTION 

nn i 

GPCRs and uses thereof 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 provides an illustration of second messenger IP] production from 
endogenous version RUP12 ("RUP12") as compared with the control ("CMV). 

Figure 2 is a graphic representation of the results of a second messenger cell- 
based cyclic AMP assay providing comparative results for constitutive signaling of 
endogenous RUP13 CRUP13") and a control vector ("CMV"). 

Figure 3 is a diagrammatic rep re senta tion of the signal measured comparing 
CMV, endogenous RUP13 ("RUP13 wt") and non-endogenous, constitunvery activated 
RUP13 ("RUP1 3(A268K) M ). utilizing 8XCRE-Luc reporter plasmid- 
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■are 4 is a graphic icpicscitatian of the results of a ("SJGTPyS assay 
comparative results for constitutive signaling by RUP13:Gs Fusion Protein 
C*RJJP13-Gs") "od • control vector ("CMV). 

Figure 5 is a diagrammatic repres en tatio n of the signal measured comparing 
CMV, endogenous RTJP14 ("RUP14 wT) and Doo-cndogenous, constitutivcly activated 
RUP13 CHUPl 4CL246X) M X utilizing 8XCRE-Luc reporter ptasmid. 

Figure 6 is a diagrammatic r epresen tation of the signal measured comparing 
CMV, endogenous RUP15 ("RUP15 wt") and non-endogenous, constitutivcry activated 
RUP15 ("RUP1 5(A398Kr)» utilizing 8XCRB-Luc reporter plasmid. 

Figure 7 is a graphic representation of the results of a second me sse n g er cell- 
based cyclic AMP assay providing co mpa r a t i ve results for constitutive signaling of 
endogenous RUP15 ("RUPI5 wt"), non-endogenous, cortstrrutivcly activated version of 
RUP15 ("RUP15<A398K)**) antfa control vector CCMV**). 

Figure 8 is a graphic r epre senta tion of the results of a ["SJGTFyS assay 
providing comparative results for constitutive signaling by RUPlSrGs Fusion Protein 
("RUP1 5-Gs") and a control vector ("CMV**). 

Figure 9 provides an illustration of second messenger IP) production from 
endogenous version RUP 17 ("RUP 1 T) as compared with the control CCMV). 

: 1 0 provides an illustration of second messenger IPj production from 
3 version RUP21 ("RUP21") as compared with the control ("CMV). 
Figure 1 1 ts a diagrammatic representation of the signal me a sur ed comparing 
CMV, endogenous RUP23 (**RUP23 wf) and non-endogenc 
RTJP23 rRUP23{W275K)-X utilizing 8XCRE-Luc reporter plasmid 
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Figure 12 is a graphic representation of results from a primary screen of several 
candidate compounds against RUP 13; results for "Compound A" are provided in well 
A2 and "Compound "B" are provided in well G9. 

5 DETAILED DESCRIPTION 

The scientific literature that has evolved around receptors has adopted a number 
of terms to refer to ligands having various effects on receptors. For clarity and 
consistency, the following definitions will be used throughout this patent document To 
the extent that these definitions conflict with other definitions for these terms, the 
1 0 following definitions shall control: 

AGONISTS shall mean materials (tg., b gauds, candidate compounds) that 
activate the intracellular response when they bind to the receptor, or enhance GTP 



PHENYl^OJVNIKB 


PHE 


F 


PROLINE 


PRO 


P 


SERINE 


SER 


S 


THREONINE 


THR 


T 


TRYPTOPHAN 


TRP 


W 


TYROSINE 


TYR 


Y 


VALINE 


VAX. 


V 



AMINO ACID ABBREVIATIONS used herein are set out in Table A: 
TABLE A 
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PARTIAL- AGONISTS shall mean materials (eg., 
compounds) that activate the intracellular response when they bind to (he receptor to a 
lesser degree/extent man do agonists, or enhance GTP binding to membranes to a lesser 
5 degree/extent than do agonists. 

ANTAGONIST shall mean materials (eg., ligands, candidate compounds) that 
competitively bind to die recep to r at the same site as the agonists but which do not 
activate the intracellular response initiated by the active form of the receptor, and can 

10 ANTAGONISTS do not dmunist 
agonist or partial agonist 

CANDIDATE COMPOUND shall mean a molecule (for e 
limitation, a chemical compound) that is amenable to a screening technique. Preferably, 
the phrase "candidate compound" does not include compounds which were publicly 

15 known to be compounds selected from (be group consisting of inverse agonist, agonist or 
antagonist to a receptor, as previously determined by an indirect id e ntific ation process 
("indirectly identified compound*); more preferably, not including an indirectly 
identified compound which has previously been determined to have therapeutic efficacy 



20 compound which has previously been determined to have therapeutic utility in humans. 
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COMPOSITION means a material comprising 



**pha 



" is an example of a composition. 



COMPOUND EFFICACY shall mean a measurement of the ability of a 
compound to inhibit or stimulate receptor functionality, as opposed to receptor binding 
affinity. Exemplary means of detecting compound efficacy are disclosed in the Example 

CO DON shall mean a grouping of three nucleotides (or equivalents to 
nucleotides) which generally comprise a nucleoside (adenosine (A), guanosine (G% 
cytidino (C), uridine (U) and thymidine (T)) coupled to a phosphate group and which. 



CONSTTTUTTVELY ACTIVATED RECEPTOR shall mean 

■ receptor activation. A constitimvcly activated receptor 



CONSTTTUTTVE RECEPTOR ACTIVATION shall mean stabilization of a 
15 receptor in the active state by means other than binding of. the receptor with its 
endogenous ligand or a chemical equivalent thereof 

CONTACT or CONTACTING shall mean bringing at least two moieties 
together, whether in an in vitro system or an in vivo 

DIRECTLY IDENTIFYING or DIRECTLY IDENTIFIED, in relationship to 
20 the phrase "candidate compound", shall mean the screening of a candidate compound 
against a constituti vely activated receptor, preferably a constitutively activated orphan 
receptor, and most preferably against a constitutively activated G protein-coupled cell 
surface orphan receptor, and assessing the compound efficacy of such compound. This 
phrase is, under no ciicumstances, to be interpreted or understood to be encompassed by 
25 or to encompass the phrase "indirectly identifying" or "indirectly identified. " 
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ENDOGENOUS shall mean a material that a mammal naturally produces. 
ENDOGENOUS in reference to, for example and not formation, the term "receptor - 
shall mean that which is naturally produced by a mammal (for example, and not 
limitation, a human) or a virus. By contrast, (he term NON-ENDOGENOUS in this 
5 context shall mean mat which is not naturally produced by a mammal (for example, and 
not limitation, a human) or a virus. For example, and not limitation, a receptor which is 
not constitutively active in its endogenous form, but when manipulated becomes 
c onst i tut ively active, is most preferably referred to herein as a "non-endogenous, 
constitutively activated receptor." Both terms can be utilized to describe both "in vivo" 
10 and "in vitro" systems. For example, and not limitation, in a screening approach, the 
endogenous or non-endogenous receptor may be in reference to an in vitro screening 
system. As a further example and not limitation, where the genome of a mammal has 



15 



screening of a candidate compound by means of an in vivo system is viable. 

G PROTEIN COUPLED RECEPTOR FUSION PROTEIN and GPCR 
FUSION PROTEIN, in the context of the invention disclosed herein, each mean a non- 
endogenous protein comprising an endogenous, constitutively activate GPCR or a non- 
endogenous, constitutively activated GPCR fused to at least one G ] 
preferably me alpha (a) subunit of such G protein (this being i 
20 GTPX with the G protein preferably being of the same type as the G protein that 
naturally couples with endogenous orphan GPCR. For example, and not limitation, in an 
endogenous state, if the G protein "Gsa" is the predominate G protein that couples with 
the GPCR, a GPCR Fusion Protein based upon the specific GPCR would be a non- 
endogenous protein comprising the GPCR fused to Gso; in some circumstances, as will 
25 be set forth below, a ncm-predonrinant G protein can be fitted to the GPCR The G 
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protein can be fused directly to the c-tcrminus of the constitutively active GPCR or there 
may be spacers between the two. 

HOST CELL shall mean a cell capable of having a Plasmid and/or Vector 
incorporated therein. In the case of a prokaryotic Host Cell, a Plasmid is typically 
5 replicated as a autorwmous molecule as the Host Cell replicates (generally, the Plasmid 
is thereafter isolated for introduction into a eukaryotic Host Cell); in (he case of a 
eukaryotic Host Cell, a Plasmid is integrated into the cellular DNA of the Host Cell such 
that when the eukaryotic Host Cell replicates, (he Plasmid replicates. Preferably, for the 
purposes of the invention disclosed herein, the Host Cell is eukaryotic, more preferably, 
10 mammahan, and roost preferably selected from the group consisting of 293. 293T and 
COS-7 cells. 

INDIRECTLY IDENTIFYING or INDIRECTLY IDENTIFIED means the 
traditional approach to the drug discovery process involving identification of an 
endogenous ligand specific for an endogenous receptor, screening of candidate 
15 compounds against the receptor for determination of those winch interfere and/or 
compete with the ligand-receptor interaction, and assessing the efficacy of the compound 
for affecting at least one second messenger pathway associated with the activated 
receptor. 

INHIBIT or INHIBITING, in relationship to the term "response" shall mean 
20 that a response is decreased or prevented in the presence of a compound as opposed to in 
the absence of the compound. 

INVERSE AGONISTS shall mean materials ligand, candidate compound) 
which bind to either the endogenous form of (be receptor or to the constitutively 
activated form of the receptor, and which inhibit the baseline intracellular response 
25 initiated by the active form of the receptor below the norma] base level of activity which 
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is observed in the absence of agonists or. partial agonists, or decrease GTP binding to 
membranes. Preferably, the baseline intracellular response is inhibited in the presence of 
the inverse agonist by at least 30%, more preferably by at least 50%, and most preferably 
by at least 75%, as compared with the baseline response in the absence of the inverse 
5 agonist 

KNOWN RECEPTOR shall mean an endogenous receptor for which the 
endogenous ligand specific for that receptor has been identified. 

LIGAND shall mean an endogenous, naturally occurring molecule specific for 
an endogenous, naturally occurring receptor. 
10 MUTANT or MUTATION in reference to an endogenous receptor's nucleic 

acid and/or amino acid sequence shall mean a specified change or changes to such 
endogenous sequences such that a mutated form of an endogenous, non-constitutivery 
activated receptor evidences constitutive activation of the receptor. In terms of 
equivalents to specific sequences, a subsequent mutated form of a human receptor is 
15 considered to be equivalent to a first mutation of the human receptor if (a) (he level of 
constitutive activation of the subsequent mutated form of a human receptor is 
substantially (he same as that evidenced by the first mutation of the receptor; and (b) the 
) percent sequence (amino acid and/or nucleic acid) homology between the subsequent 

mutated form of the receptor and the first mutation of the receptor is at least about 80%, 
20 more prefcrabry at least about 90% and most preferably at least 95%. Ideally, and owing 
• to the fact that the most preferred cassettes disclosed herein for achieving constitutive 
activation includes a single amino acid and/or codon change between the endogenous 
and the iion-endogenous forms of the GPCR, the percent sequence homology should be 
at least 98% 

10 
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NON-ORPHAN RECEPTOR shaD mean an end 
molecule specific for an endogenous naturally occurring ligand wherein the binding of a 
bgand to a receptor activates an mtiace Uul a r signaling pathway. 

ORPHAN RECEPTOR shall mean an endogenous re ce ptor for which the 
5 endogenous rigaod speriBc fen thai receptor has not been identified or is not known. 

PHARMACEUTICAL COMPOSITION shaD mean a composition 
co mprisin g at least one active ingredient, whereby the co mp o si tion is amenable to 
investigation for a specified, efficacious outcome in a mamma] (for example, and not 
limitation, a human). Those of ordinary skill in the art will understand and appreciate the 
10 techniques ap p ropr iate for determining whether an active ingredient has a desired 
efficacious outcome based upon the needs of the artisan. 

PLASMH) shall mean the comb ina tion of a Vector and cDNA. Generally, a 
Plasmid is introduced into a Host Cell for the purposes of replication a 
of the cDNA as a protein. 
15 SECOND MESSENGER shall mean an intracellular respons 

result of receptor activation. A second messenger can include, for example, inositol 
triphosphate (TPjX diacyegrycero] (DAG), cyclic AMP (cAMP), and cyclic GMP 
(cGMP). Second messenger response can be measured for a dcttjiitiml w ii of receptor 
activation. In addition, second messenger response can be measured for the direct 
20 identification of candidate compounds, i n c lu di n g for example, inverse agonists, agonists, 
partial a g o ni sts and antagonists. 

STIMULATE or STIMULATING, in relationship to the term "response" shall 
mean that a response is increased in the presence of a compound as opposed to in the 
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VECTOR in reference to cDNA shall mean a circular DNA capable of 
incorporating at least one cDNA and capable of incorporation into a Host Cell. 

The order of the following sections is set forth for presentational efficiency and 
is not intended, nor should be construed, as a limitation on the disclosure or the claims to 
5 follow. 

A* Introduction 

The traditional study of receptors has always proceeded from the a priori 
assumption (historically based) that the endogenous bgand must first be identified before 

10 discovery could proceed to find antagonists and other molecules that could affect the 
receptor. Even in cases where an antagonist might have been known (he search 
i mmedi ately extended to looking for the endogenous ligand. This mode of forking has 
persisted in receptor research* even after the discovery of constjtutively activated 
r e ceptors. What has not been heretofore recognized is that it is the active state of the 

15 receptor mat is most useful for discovering agonists, partial agonists, and inverse 
agonists of the receptor. For those diseases which result from an overly active receptor 
or an under-active receptor, what is desired in a therapeutic drug is a compound which 
acts to diminish the active state of a receptor or enhance the activity of the receptor, 
respectively, not necessarily a drug which is an antagonist to the end og enous ligand. 

20 This is because a compound that reduces or enhances the activity of the active receptor 
state need not bind at the same she as the endogenous ligand. Thus, as taught by a 
method of this invention, any search for therapeutic compounds should start by 



25 B. Identification of Human GPCRs 
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The efforts of the Human Genome project has led to the identification of a 
plethora of information regarding nucleic acid sequences located within the human 
genome; H has been the case in this endeavor that genetic sequence information has been 
made available without an understanding or recognition as to whether or not any 
5 particular genomic sequence does or may contain open-reading frame information that 
translate human proteins. Several methods of identifying nucleic acid sequences within 
the human genome are within the purview of those having ordinary skDl in the art For 
example, and not limitation, a variety of human GPCRs, disclosed herein, were 
discovered by reviewing the GenBank™ database. Table B, below, fists several 
10 endogenous GPCRs that we have discovered, along with other GPCR's that are 
homologous to the disclosed GPCR- 

TABLE B 



Dtasasad 
Orphan GPCR J 


Number 
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Open Reading 
Pram* . 
(Bas* Pain) 


Reference To 
HonMtogens 
GfCH 


Percent 
Homology 
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GPCR 


hRUPd 


All2l«i 


i.itebp 


NPV2ft 


2-ni 


hRUF9 


AC01 13371 


l,2oubp 


GAL2R 


2294 


hRUFlO 


AtiAWAS 


i,6i4bp 


CSaR . 


40% 


hRUPii 




l,Z72bp 


HM74 


36% 


hRUrtl 


AP000S08 




Masl 


34% 




ACOJ1780 




FttfaGPRX- 
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43% 


*kuri< 


AL137U* 


l,041bp 
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35% 




ALotua 


UiTbp 
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hRVPli 


AL 1361 06 


l.Ofcghp 


GLrioi 


37% 


hhupi'7 


AO023O78 




Masl 


37% 


ERUPTS 




l^05bp 


Oxytocin 


31% 


a»t)ri9 


AC026331 


1.041bp 


HM74 


52% - 


faRUPtt 


ALI6U58 


1,01 Ibp 


GPR34 


25*" 


hRUPii 


U -A&>26756- 


l,014bp 
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37% 
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993bp 


Masl 
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HRUP23 


AC607104 


1.092bp 


Rat GPR26 


31% 


hRvru 


AL355388 


l,12Sbp 


SALPR 


44% 


hRUP25 


At026iil 


l,092bp 


HM74 


95% 


bRUP26 


AC023040 


l,044bp 


Rabbit 5kTlb 


27% 


•RUP27 


ACtJ27o43 


158.700 


MOl 


38% 1 



receptors within the human body. As the patent document p rogre ss es, we will disclose 
techniques for mutati ng these receptors to f-Atnblish non-endogenous, constttunvely 
5 activated versions of these receptors. 

The techniques disclosed herein have also been applied to other human, orphan 
GPCRs known to the art, as will be apparent as the patent document progresses. 

C. Receptor Screening 

10 Screening candidate compounds against a non-endogenous, constitutively 

activated version of the human GPCRs disclosed herein allows for the direct 
identification of candida te compounds which act at this cell surface rec ep tor, without 
requiring use of the receptor's endogenous ligand. Using routine, and often 
commercially available techniques, one can determine areas within the body where the 

15 endogenous version of human GPCRs d isclos ed herein is expressed and/or over- 
expressed. It is also possible using these techniques to determine related 
disease/disorder states which are associated with the expression and/or over-expression 
of the receptor, such an approach is disclosed in this patent document 

With respect to creation of a mutation that may evidence constitutive activation 

20 of the human GPCR disclosed herein is based upon the distance from the proline r e sidue 
at which is presumed to be located within TM6 of the GPCR; this algorithmic technique 
is disclosed is co-pending and commonly assigned patent document PCT Application 
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# 

a ApnT20, 20 
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Number PCTAJS99/23938, published as WO 00/22129 on ApnTTO". 2000, which, along 
with the other patent documents listed heron, is incorporated herein by reference. The 
algorithmic technique is not predicated upon traditional sequence "alignment** but rather 
a specified distance from the af or eme n tioned TM6 proline residue (or, of course, 
5 endogenous constitutive substitutionf for such proline residue). By mutating the amino 
acid residue located 16 amino acid residues from this residue (presumably located in the 
IC3 region of the receptor) to, most preferably, a lysine residue, such activation may be 
obtained. Other amino acid residues may be useful in the mutation at this position to 
achieve this objective. 

10 



As will be set form in greater detail below, most preferably inverse agonists and 
agonists to the non-endogenous, constitutively activated GPGR can be identified by the 
methodologies of this invention. Such inverse agonists and agonists are ideal farKfidftT 

15 as lead compounds in drug discovery programs for treating diseases related to this 
receptor. Because of the ability to directly identify inverse agonists to the GPCR, 
thereby allowing for the development of pharmaceutical compositions, a search for 
diseases and disorders associated with the GPCR is relevant. For example, scanning 
both diseased and normal tissue samples for the presence of the GPCR row becomes 

20 more than an academic exercise or one which might be pursued along the path of 
identifying an endogenous ligand to die specific GPCR. Tissue scans can be conducted 
across a broad range of healthy and diseased tissues. Such tissue scans provide a 
preferred first step in associating a specific receptor with a disease and/or disorder. 

Preferably, the DNA sequence of die human GPCR is used to make a probe for 

25 (a) dot-blot analysis against tissue-mRNA, and/or (b) RT-PCR identification of the 
expression of the receptor in tissue samples. The presence of a receptor in a tissue 



source, or a diseased tissue, or the presence of the receptor at elevated concentrations in 
diseased tissue compared to a norma) tissue, can be preferably utilized to identify a 
correlation with a treatment regimen, including but not limited to, a disease associated 
with that disease. Receptors can equally well be localized to regions of organs by this 
5 technique. Based on the known functions of the specific tissues to which the receptor is 
localized, the putative functional role of the receptor can be deduced. 



1. Generic GPCR screening assay techniques 

10 When a G protein receptor becomes constitutively active, it binds to a G protein 

Gq, Gs, Gi. Gz, Go) and stimulates the binding of GTP to the G protein. The G 
protein then acts as a GTPase and slowly hydro ryzes the GTP to GDP, whereby the 
receptor, under normal conditions, becomes deactivated. However, constitutively 
activated recep t ors continue to exchange GDP to GTP. A non-hydrolyzable analog of 

15 GTP, [ 55 S]GTPyS, can be used to monitor enhanced binding to membranes which 
express constitutively activated receptors. It is reported that ["SJGTPyS can be used to 
monitor G protein coupling to membranes in die absence and presence of ligand. An 
example of this monitoring, among other examples well-known and available to those in 
the art, was reported by Traynor and Nahorski in 1995. The preferred use of this assay 

20 system is for initial screening of candidate compounds because die system is genetically 
applicable to all G protein-coupled receptors regardless of the particular G protein that 
interacts with the intracellular domain of the receptor. 

2. Specific GPCR screening assay techniques 

Once candidate compounds are identified using the "generic" G protein-coupled 
25 receptor assay (£«.. an assay to select compounds that are agonists, partial agonists, or 
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.' receptor site is preferred. For example, a compound identified by die "generic'' assay 

may not bind to the receptor, but may instead merely "uncouple" die G protein from the 

intracellular domain. 

a, Gs, Gz and Gi. 

5 Gs stimulates the enzyme adenyh/l cyclase. Gi (and Gz and Go), on the other 

hand, inhibit this enzyme. Adenylyi cyclase catalyzes the conversion of ATP to cAMP; 
thus, constitutively activated GPCRs that couple the Gs protein are associated with 
increased cellular levels of cAMP. On the other hand, constitutively activated GPCRs 
that couple Gi (or Gz, Go) protein are associated with decreased cellular levels of cAMP. 
10 See, generally, "Indirect Mechanisms of Synaptic Transmission," Chpt 8, From Neuron 
To Brain (3* Ed.) Nichols, J.G. etaleds. Sinauer Associates, Inc. (1992). Thus, assays 
that detect cAMP can be utilized to determine if a candidate compound is, e.g., an 
inverse agonist to die receptor (£«., such a compound would decrease the levels of 
cAMP). A variety of approaches known m the art for measuring cAMP can be utilized; 
15 a most preferred approach relies upon the use of anti-cAMP antibodies in an ELISA- 
based format Another type of assay that can be utilized is a whole cell second 
messenger reporter system assay. Promoters on genes drive the expression of the 
proteins that a particular gene encodes. Cyclic AMP drives gens expression by 
promoting the binding of a cAMP-responsive DNA binding protein or transcription 
20 factor (CREB) that then binds to the promoter at specific sites called cAMP response 
elements and drives the expression of the gene. Reporter systems can be constructed 
which have a promoter containing multiple cAMP response elements before the reporter 
gene, e.g.. p^galactosidase or luciferase. Thus, a constitutively activated Gs-linked 
receptor causes the accumulation of cAMP that then activates the gene and expression of 

17 
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the reporter protein. The reporter protein such as B-galact os ida s c or luciferase can then 
be detected using standard biochemical assays (Chen et al. 1 995). 
h. Go and Gq. 

5 Gq and Go arc associated with activation of die enzyme phosphotipase C, which 

in turn hydrolyzes die phospholipid PIP2, releasing two intracellular messengers: 
diacyctoglycerol (DAG) and irastol 1 ,4,5-mphoisphate (IPj). Increased accumulation of 
IPj is associated with activation of Gq- and Go-associated receptors. See, generally, 
'Indirect Mechanisms of Synaptic Transmission," Chpt. 8, From Neuron To Brain (3 rt 

10 Ed.) Nichols, J.G. et al eds. Sinauer Associates, Inc. (1992). Assays that detect TP, 
accumulation can be utilized to determine if a candidate compound is, e.g., an inverse 
agonist to a Gq- or Go-associated receptor (i.e., such a compound would decrease the 
levels of IPs). Gq- associa t ed receptors can also been examined using an API reporter 
assay in that Gq-dependent phospholipase C causes activation of genes containing API 

IS elements; thus, activated Gq-associated receptors will evidence an increase in the 
expression of such genes, whereby inverse agonists thereto will evidence a decrease in 
such expression, and agonists will evidence an increase in such expression. 
Commercially available assays for such detection are available. 
3. GPCR Fusion Protein 

20 The use of an endogenous, constitutively activate orphan GPCR or a non- 

endogenous, constitutively activated orphan GPCR, for use in screening of candidate 
compounds for the direct identification of inverse agonists, agonists and partial agonists 
provide an interesting screening challenge in that, by definition, (he re ce ptor is active 
even in the absence of an endogenous ligand bound (hereto. Thus, in order to 

25 differentiate between, e.g., the non-endogenous receptor in the presence of a candidate 
compound and the non-endogenous receptor in the absence of that compound, with an 

18 



WO 01/3*47] 

aim of such a differentiation to aHow far in understanding as to whether such compound 
may be an inverse agonist, agonist, partial agonist or have no affect on such a receptor, it 
is preferred that an approach be u tili zed that can e nh a n ce such differentiation. A 
preferred approach is the use of a GPCR Fusion Protein. 
5 Generally, once it is determined that a non-endogenous orphan GPCR has been 

constitutively activated using the assay techniques set forth above (as well as others), it 
is possible to determine the predominant G protein that couples with the endogenous 
GPCR. Coupling of the G protein to the GPCR provides a signaling pathway that can be 
assessed. Because it is most preferred that screening take place by use of a mammalian 
10 erpressiQn system, such a system wiD be expected to have endogenous G protein therein. 
Unas, by definition, in such a system, the non-endogenous, constitute vdy activated 
orphan GPCR will continuously signal, to uns regard, it is preferred that this signal be 
enhanced such that in the presence of. eg, an inverse agonist to the receptor, it is mare 
likely that it wiD be able to more readily differentiate, particularly in the context of 

15 



The GPCR Fusion Protein is intended to enhance the efficacy of G j 
coupling with the non-endogenous GPCR. The GPCR Fusion Protein is preferred for 
screening with a non-endogenous, constinitivery activated GPCR because such an 
approach increases the signal mat is most preferably utilized in such screening 
techniques. This is important in facilitating a significant "signal to noise" ratio; such a 
significa nt ratio is import preferred for the screening of ran^ i^ ar tf compounds as 

The construction of a construct useful far expression of a GPCR Fusion Protein 
is within the purview of those having ordinary skill in the art Commercially available 
expression vectors and systems offer a variety of approaches mat can fit the particular 



needs of an investigator. The criteria of importance for such a GPCR Fusion Protein 
construct fa that the endogenous GPCR sequence and the G protein frgugncft both be ro- 
fiaroe (preferably, the sequence for the endogenous GPCR is upstream of the G protein 
sequence) and that the "stop" codon of the GPCR must be deleted or replaced such that 
upon expression of the GPCR. the G protein can also be e xp re s se d . The GPCR can be 
linked directly to the G protein, or there can be spacer residues between the two 
(preferably, no more man about 12, although this number can be readily ascertained by 
one of ordinary skill in the art). We have a preference (based upon convenience) of use 
of a spacer in that some restriction sites that are not used will, effectively, upon 
expre ss ion, become a spacer. Most preferably, the G protein that couples to the non- 
endogenous GPCR will have been identified prior to the creation of the GPCR Fusion 
Protein construct. Because there are only a few G proteins mat have been identified, it is 
pre ferre d that a construct comprising the sequence of the G protein (Le., a universal G 
protein construct) be available for insertion of an endogenous GPCR sequence therein; 
this provides far efficiency in the context of large-scale screening of a variety of 
different endogenous GPCRs having different sequences. 

As noted above, constitutively activated GPCRs that couple to Gi, Gz and Go aze 
expected to inhibit the formation of cAMP making assays based upon these types of 
GPCRs challenging (i.e., the cAMP signal decreases upon activation thus making the 
direct identification of, eg, inverse agonists (which would further decrease this signal), 
interesting. As will be disclosed herein, we have ascertained that for these types of 
receptors, it is possible to create a GPCR Fusion Protein that is not based upon the 
endogenous GPCR's endogenous G protein, in an effort to establish a viable cy cl as e 
based assay. Thus, for example, an endogenous Gi coupled receptor can be fused to a Gs 
protein - we believe that such a fusion construct, upon expression, "drives" or "forces" 
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the endogenous GPCR to couple with, «^., Gs rather than the "natural" Gi protein, such 
that a cyclase-based assay can be established. Thus, far Gi, Gz and Go coupled 
receptors, we prefer that that when a GPCR Fusion Protein is used and the assay is based 
upon detection of adenytyl cyclase activity, fast the fusion construct be established with 
5 Gs (or an equivalent G protein mat stimulates the formation of the enzyme adenylyl 
cyclase). 

Equally effective is a G Protein Fusion' construct that utilizes a Gq Protein fused 
with a Gs, Gi, Gz or Go Protein. A most p re f erred fusion construct can be accomplished 
with a Gq Protein wherein the first six (6) amino acids of the G-protein a-subunit 

1 0 ("Gaq") is deleted and the last five (5) amino acids at the C-teiminal end of Gaq is 
replaced with the corresponding amino acids of the Go of the G protein of interest. For 
example, a fusion construct can have a Gq (6 amino acid deletion) fused with a Gi 
Protein, resulting in a "Go/Gi Fusion Construct". We believe mat tins fusion construct 
will force the endogenous Gi coupled receptor to couple to its non-endogenous G 

IS protein, Gq, such that the second messenger, for example, inositol triphosphate or 
diacylgycerol, can be measured in lieu ofcAMP production. 

4. Co-tranifectton of a Target Gi Coupled GPCR with a Signal- 
Enhancer Gs Coupled GPCR (cAMP Based Assays) 

20 

A Gi coupled receptor is known to inhibit adenytyl cyclase, and, therefore, 
decrease the level of cAMP production, which can make assessment of cAMP levels 
challenging. An effective technique in measuring the decrease m production of cAMP as 
en indication of constitutive activation of a receptor mat predominantly couples Gi upon 
25 activation can be accomplished by co-tnmsfecting a signal enhancer, eg., a non- 
endogenous, constitutively activated receptor that p redo min antly couples with Gs upon 
activation (eg., TSHR-A623I, disclosed below), with the Gi linked GPCR. As is 
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apparent, constitutive activation of a Gs coupled receptor can be determined based upon 
an increase in production of cAMP. Constitutive activation of a Gi coupled receptor 
leads to a decrease in production cAMP. Thus, the co-txansfection approach is intended 
to advantageously exploit these "opposite" affects. For example, co^tnmsfection of a 
5 non-endogenous, constitutively activated Gs coupled re ce pt ui (the "signal enhancer") 
with the endogenous Gi coupled receptor (the "target receptor") provides a baseline 
cAMP signal (Le., although the Gi coupled receptor, will decrease cAMP levels, this 
"decrease" will be relative to the substantia] increase in cAMP levels established by 

J 

constitutively activated Gs coupled signal enhancer). By men co-transfecting the signal 
10 enhancer with a constitutively activated version of the target receptor, cAMP would be 
expected to further decrease (relative to base line) due to the increased functional activity 
of the Gi target (£«., which decreases cAMPk 

Screening of candidate compounds using a cAMP based assay can then be 
accomplished, with two provisos: first, relative to the Gi coupled target receptor, 
15 "opposite" effects will result, i.e., an inverse agonist of the Gi coupled target receptor 
will increase the measured cAMP signal, while an agonist of the Gi coupled target 
receptor will decrease this signal; second, as would be apparent, candidate compounds 
) that are directly identified using this approach should be assessed independently to 

ensure that these do not target the signal enhancing receptor (this can be done prior to or 

F. M cd Ideal Chemistry 

Generally, but not always, direct identification of candidate compounds is 
preferably conducted in conjunction with compounds generated via combinatorial 
25 chemistry techniques, whereby thousands of compounds are randomly prepared for 
such analysis. Generally, the results of such screening will be compounds having 
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unique core ttructurcs; thereafter, these compounds are^pwRfabry subjected to 
additional chemical modification around a preferred core structures) to further 
enhance the medicinal properties (hereof. Such techniques are known to those in the 
art and will not be addressed in detail in this patent document. 



pharmaceutical compositions using techni ques well known to those in the ait. Suitable 
pharmaceutical ly-o cc cptablc earners are available to those in the ait; for example, see 
10 Remington's Pharmaceutical Sciences, 16* Edition, 1980, Mack Publishing Co., (Oslo 
et aL r eds.). 

, H. Other Utility 

Although a preferred use of the non-endogenous versions the human GPCRs 
15 • disclosed herein may be for the direct identification of candid at e compounds as inverse 
agonists, agonists or partial agonists (preferably for use as pharmaceutical agents), these 
versions of human GPCRs can also be utilized in research settings. For example, in vitro 
and in vivo systems incorporating GPCRs can be utilized to further elu cidat e and 
understand the roles- these receptors play in the human condition, both normal and 
20 diseased, as well as understanding the role of constitutive activation as it applies to 
g cascade. The value in non-endogenous human GPCRs is 
h tool is enhanced in that , b wi ii we of then - unique features, 
nan-endogenous hiiniiiH GPCRs can be used to understand the role of these receptors in 
the human body before die endogenous hgand therefore is identified. Other uses of die 
25 disclosed receptors will become apparent to those in die art based upon, inter olio, a 
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EXAMPLES 

The following examples are presented far purposes of elucidation, and not 
limitation, of the present invention. While specific nucleic add and amino acid 
sequences are disclosed herein, those of ordinary skill in the art are credited with the 
5 .ability to make minor modifications to these sequences while achieving the same or 
s u b sta n tially similar results reported below. The traditional approach to application or 
understanding of sequence cassettes from one sequence to another (eg from rat receptor 
to human receptor or from human receptor A to human receptor B) is generally 
predicated upon sequence alignment techniques whereby the sequences are aligned in an 

10 effort to determine areas of commonahty. The mutational approach disclosed herein 
does not rely upon this approach but is instead based upon an algorithmic approach and a 
positional distance from a conserved proline residue located within the TM6 region of 
human GPCRs. Once this approach is secured, those in the art are credited with the 
ability to make minor modifications thereto to achieve substantially the same results (ie, 

15 constitutive activation) disclosed herein. Such modified approaches are considered 
within the purview of this disclosure. 



II 
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1. Identification of Human GPCRs 
The disclosed endogenous human GPCRs were identified based upon a review 
25 of the GcnBank™ database information. While searching the database, the following 
cDNA clones were identified as evidenced below (Table C). 



Orpins 
GPCRs 


Accession 
Number 
Identified 


CMBptctsDNA 
Sequence 
(Base Pairs) 


Open Kcsdiog 
Frone 
(Base Pain) 


Nucleic 

Add 
SEQJD. 

. NO. 


Abu no 
Add 

SEQJD. 
NO. 


KRUP8 


AL1217S5 




1.152bp 


» 


2 


fckUP9 


AC0ll337i 




U60bp 


3 


4 


hRlJPlO 


AC008745 




l,014bp 


3 


6 


bRUFll 


Attitttt 




1.272bp 


1 


« 


hRUPH 


AP000S6S 


-I77,7«bp 


966bp 


9 


10 


bRurrt 


Acorns 


lfl^t9bp 


U56bp 


II 


12 


bkupi4 


AtiS7ii8 


168^97bp 


l,041bp 


13 


14 


hfcUFlS 


AL016468 


l3U2Sbp 


1.527bp . 


15 


16 


KRUPJ6 


AL136106 


208,042bp 


l,068bp 


l7 


it 


hRttF17 


ACO23078 


l6'i.735bp 


969bp 


IS 


20 " 


hkupis 


AC008547 


I17,304bp 




21 


22 


MWP19 


AC02633 1 


145.163bp 


l,041bp 


23 


24 


— ESCP20-- 


AUdUia 


liS^nbp 


i,oi ibp 


23 


26 


hRUKl 




U*4M«, 


l.014bp 


21 


2i 


hRUKU 


AC027624 


151.SI Ibp 


993bp 


» 


30 


bHUPXJ 


ACOWioi ■ 


200,000bp 


l,092bp 


31 ■ " 


32 




ALU53&8 ' 


l90338bp 


».125bp 


33 


34 




AOtttiJt ' 


141l83bp 


1,092 bp 


53 


36 


kRitrrt 


ACO23O40 


178408bp 


].044bp 


W 


38 " 




AC027<U3 


iSS,7b0bp 


i,020bp 


» 


46 



2. FnD Length Cloning 
a. hRUP8fSeq.Id.Ncs.lA2) 
The disclosed human RUP8 was identified based upon the use of EST database 
(dfc£$7) rnfbrrnstion. While searching the dbEST, a cDNA ct 
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AL121755 was identified to encode a novel GPCR. The following PCR primers were 
used for RT-PCR with human testis Marathon-Ready cDNA (Oontech) as templates: 
5 '-CTTGCAGACATCACCATGGCAGCC-3 ' (SEQJD.NO-:41; sense) and 
5 '-GTGATGCTCTGAGTACTGGACrGQ-3' (SEQ.ID.NO.: 42; antiseme). 
PCR was performed using Advantage cDNA polymerase (Ckmtech; manufacturing 
instructions will be followed) in 50ul reaction by the following cycles: 94°C for 30 sec; 
94-C for 10 sec; 65°C for 20 sec, 72°C for 1.5 mm, and 72*C for 7 min. Cycles 2 
through 4 were repeated 35 times. 

A l_2kb PCR fragment was isolated and cloned into the pCRD-TOPO vector 
(Invitrogen) and sequenced using the ABI Big Dye Terminator lot (PJL Biosystem). 
See, SEQ.ID.NO-: 1. The putative amino acid sequence for RUP8 is set forth in 
SEQ.ID.NO. :2. 

b. hRUP9 (Seq. Id. Nos. 3 & 4) 

The disclosed human RUP9 was identified based upon the use of GeneBank 
database information- While searching the database, a cDNA clone with Accession 
Number AC01 1375 was identified as a human genomic sequence from chromosome 
5. The full length RUP9 was cloned by PCR using primers: 
5 '^G AAGCTGTGAAGAGTGATGC-3 ' (SEQJD^Oj43; sense), 
5 •-CTCAGCAATATTG ATAAGCAGCAG-3 ■ (SEQID.NO.:44; antiserum) 
and human genomic DNA (Promcga) as a template. Taq Plus Precision polymerase 
(Stratagems) was used for the amplification m a lOOul reaction with 5% DMSO by the 
following cycle with step 2 to step 4 repeated 35 times: 94«C for 1 minute; 94 "C for 
30 seconds; 56°C for 30 seconds; 72 "C for 2 minutes; 72*C for 5 minutes. 

A 13 Kb PCR fragment was isolated and cloned into the pCRD-TOPO vector 
(Irrvftrogen) from 1% agarose gel and completely sequenced using the ABI Big Dye 



! 
! 



WO 010*47] 

Terminator kit (P.E. Biosystem). See, SEQ.ID.NO.:3. 



The^^atrve amino acid 



sequence for RUP8 is set forth in SEQJD.NO.:4. The sequence of RUP9 clones isolated 
from human genomic DNA matched with the sequence obtained from data base, 
c bRUP10(Scq.Id.Nos.5&6) 
The disclosed human RUP10 was identified based upon the use of GenBank 
ormanon. While searching the database, a cDNA clone with accession 
AC008754 was identified as a human genomic sequence from chromosome 
19. The full length RUP1 0 was cloned by RT-PCR using primers: 
5 •-OCATGGGGAACGATTCTGTCAGCTACG-3 1 (SEQ.ID.NO.:45; sense) and 
10 5*-GCTATGCCTGAAGCCAGTCTTGTG-3 ' (SEQJD.NO.:46; antisense) 

and human leukocyte Marathon-Ready cDNA (Cloniech) as a template. Advantage 
cDNA polymerase (Cloniech) was used for the amplification in a 50ul reaction by the 
following cycle with step 2 to step 4 repeated 35 times: 94°C for 30 seconds; 94*C 
for 10 seconds; 62*C for 20 seconds; 72°C for 1 .5 minutes; 72°C for 7 minutes. A 1.0 
15 Kb PCR fragment was isolated and cloned into the pCRH-TOPO vector (Invitrogen) 
and completely sequenced using the ABI Big Dye Terminator kit (P.E. Biosystem). 
The nucleic acid sequence of the novel human receptor RUP10 is set forth in 
SEQID.NO.:5 and the putative amino acid sequence thereof is set forth in 
SEQJD.NO.:6. 

20 

d. hRUPl 1 (Seq. Id. Nos. 7 4 8) 
The disclosed human RUP1 1 was identified based upon the u 
database information. While searching the database, a cDNA clone 
25 number AC01 3396 was identified as a human genomic sequence from 



PCT/WSOOOlso* 
The full length RUPl I was cloned by PCR using primers: 
5 '-CCAGGATGTTGTGTCAOOGTGGTGGC-3' (SEQJD.NO.:47; sense). 
5'<^CAGCGCTf3CACXXXnGCAGCrGGC-3* (SEQJD.NO.:48; antisense) 
and human genomic DNA (Clem tech) as a template. TaqPhis Precision DNA 
polymerase (Stratagene) was used for the amplification in a 50ul reaction by the 
foDowing cycle with step 2 to step 4 repealed 35 times: 94°C for 3 rninutcs; 94"C for 20 
seconds; 67°C for 20 seconds; 72*C for 1 .5 minutes, 72^ fox 7 minutes. A 1 J Kb PCR 
fragment was isolated and cloned into the pCRH-TOPO vector (Invitrogen) and 
completely sequenced using the ABI Big Dye Terminator kit (P.E. Biosystem). The 
nucleic acid sequence of the novel human receptor RUPl t is set form in SEQJD>K>.:7 
o acid sequence thereof is set forth in SEQJD.NO.:8. 
c hRUP12(S*q.Id.Nos.9&10) 
The disclosed human -RUPl 2 was identified based upon the use of GenBank 
While searching the database, a cDNA clone with accession number 



AP000808 was identified to encode a new GPCR, having significant homology with rat 
RTA and human masl oncogene GPCRs. The full length RUPl 2 was cloned by PCR 



5 '-CTTCCTCTOGTAGGGATGAACCAGAC-3 1 {SEQJD.NO.:49; sense) 
5 '<TCG<^C>LG<nXXK3AAGCACCTGTGO-3 • (SEQ.ID.NO.^0; m 
and human genomic DNA (Oorrtech) as template. TaqPlus Precision DNA porymerase 
(Stratagene) was used for the amplification by die following cycle with step 2 to step 4 
repeated 35 times: 94"C for 3 min; 94°C for 20 sec; 65°C for 20sec; 72°C for 2 rnin and 
72*C for 7 rnin. A l.Okb PCR fragment was isolated and cloned into the pCRH-TOPO 
vector (Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit 
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PCT/USOO/31509 
and SEQ.ID.NO.:10 for 



(P.E. Biosystem) (see. SEQ-ID.NO.:9 for nucleic acid 
deduced amino acid sequence). 

£ bRUP13(Seq.Id.Nos.ll&12) 

The disclosed human RUPl 3 was identified based upon the use of 
GenBank database. While searching flic database, a cDNA clone with accession number 
AC011780 was identified to encode a new GPCR, having significant homology with 
GPCR fish GPRX-ORYLA. The full length RUPl 3 was cloned by PCR using primers: 
5 , -GCCTGTGA<^GGAGGTACCCTGO-3' (SEQ.ID.NO.:5 1 ; sense) 
5 , ^TATCCClXXX3AGTOTCCAGCGGC- 3 ' (SEQ.ID.NO.:52; antisense) 
and human genomic DNA (Oontech) as template. TaqPhn Precision DNA polymerase 
(Stratagene) was used for the amplification by the following cycle with step 2 to step 4 
repeated 35 times: 94°C for 3 rnin; 94°C for 20 sec; 65*C for 20sec; 72*C for 2 rnin and 
72*C for 7 rnin. A 1 J5kb PCR fragment was isolated and cloned into the pCRH-TOPO 
vector (Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit 
(Pi. Biosystem) (see. SEQ ID-NO.: 11 for nucleic acid sequence and SEQ.ID.NO.:12 
for deduced amino acid sequence). 

g, hRUP14 (Seq. Id. Nos. 13 St 14) 
The disclosed human RUPl 4 was id 
database information. While searching the database, a cDNA clone with Aco 
Number AL 1371 18 was identified as a hue 
1 3. The full length RUP14 was cloned by PCR u 
S'-GCATGOAGAGAAAATTTATOTCCTTOCAACC^' (SEQJD.NO.J3; sense) 
5 •<^GAACAGGTCTCATCTAAGAGCTCC-3 • (SEQJDJ>IO.:54; antisense) 
and human genomic DNA (Promega) as a template Taq Plus Precision polymerase 
(Stratagene) and 5% DMSO were used for the amplification by the following cycle 
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with step 2 and step 3 repeated 35 times: 94°C for 3 minute; 94°C for 20 s 
58«C for 2 minutes; 72*C for 1 0 rninutcs. 

A 1.1 Kb PCR fragment was isolated and cloned into the pCRH-TOPO vector 
(Invitrogen) and completely sequenced using the ABI Big Dye Terminator kit (P£. 
Biosystem) (see. SEQ.ID.NO.:13 for nucleic acid sequence and SEQJD.NO.:14 for 
deduced amino acid sequence). The sequence of RUPl 4 clones isolated from human 
genomic DNA matched with the sequence obtained from database, 
b. hRUPl 5 (Seq. Id. Nos. IS A 16) 

The disclosed human RUPl 5 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA close with Accession 
Number A CO 16468 was identified as a human genomic sequence. The full length 
RUPl 5 was cloned by PCR using primers: 

5 r -GCTGTTGCCATGACGTCCACCTGCAO-y (SEQ.ID.NO.:55; sense) 

5 '^GACAGTTCAAGGTTrOCCTTAGAAC-3' (5EQJD.NO.36; antisense) 

and human genomic DNA (Promega) as a template. Taq Plus Precision polymerase 

(Stratagene) was used for the amplification by the following cycle with step 2 to 4 

repeated 35 times: 94»C for 3 inmute; 94°C for 20 seconds; 65 "C for 20 seconds; 

72 *C for 2 minutes and 72*C for 7 minutes. 

A 1.5 Kb PCR fragment was isolated and cloned into the pCRQ-TOPO vector 
(Invitrogen) and completely sequenced using foe ABI Big Dye Terminator kit (P EL 
Biosystem). See, SEQ JD .NO: 13 for nucleic acid sequence and SEQJDJvIO.:16 for 
> acid sequence. The sequence of RUPl 5 c 
cDNAn 

L tiRUP16(Seq.Id.No*.I7&18) 



3© 



per 

The disclosed human RUP16 was identified based u^iffhe nse of GeneBank 
database information. While searching the database, a cDNA close with Accession 
Number AL 1 36 1 06 was identified as a human genomic sequence I 
13. The full length RUP1 6 was cloned by PCR using prima*: 
5 5*4-1 1 1 eXJATACTOCTtXT ATGCTC-3 * cense, 5* of in 

5 '-GTAGltXACTGAAAGTOCAGTOATOC-3 ' (SEQJD.NO.:58; anttsense. 3" of stop codon) 
and human skeletal muscle Marathon-Ready cDNA (Clontech) as template. Advantage 
cDNA polymerase (Clontech) was used for the amplification hi a SOul reaction by the 
following cycle with step 2 to 4 repeated 35 times: 94*C for 30 seconds; 94«C for 5 
10 seconds; 69°C for 15 seconds; 72*C for 1 minute and 72*^ for 5 minutes. 

A 1.1 Kb PGR fragment was isolated and closed into the pCRD-TOPO vector 
(Invitrogen) and completely sequenced using the 17 sequenase kit (Amsham). See, 
SEQJDJMO.:17 for nucleic acid sequence and SEQJD.NO.:18 for deduced amino acid 
. sequence. The sequence of RUP16 clones matched with four unentered segments of 
15 ■ AL136106, indicating that the RTJP16 cDNA is composed of 4 eaons. 
J. hRUP17(Seo.Id.Nos.l9A20) 
The disclosed human RUP17 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC023078 was identified as a human genomic sequence from chromosome 
20 11. The full length RUP17 was closed by PCR using primers: 

5 *-TTTCTGAGCATGGATXXAAOCATCTC-3* (SEQJD.NO.:S9; sense, c 



5 '-CTGTCTCACAGGGCAG AGGCKTTTC-3 * (SEQJD_NO.:60; antisense, 3' of stop codec) 
and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
25 (Clontech) was used for the amplification in a 1 OOul reaction with S*A DMSO by the 



following cycle with step 2 to 4 repeated 30 times: 94»C for 1 min; 94°C for 1 5 sec; 
67*C for 20 sec; T2*C for 1 mm and 30 sec; and 72*C for 5 min. 

A 970bp PCR fragment was isolated from 1% agarose gel and closed into the 
pCRH-TOPO vector (mvitrogen) and completely sequenced using the ABI Big Dye 
5 Tcrmiantor Kit (P.E. Biosystem). See. SEQJD.NO.:19 for nucleic acid sequence and 
SEQJD»O :20 for deduced amino acid sequence. 

It. bRUP18(Seq.Id.Nos.21&22) 
The disclosed human RUP18 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone 
10 Number A COOS 547 was identified as a human genomic sequence from c 
5. The full length RUP1 8 was cloned by PCR using primers: 
5M3CJAACTCGTATAGACCCAG43GTCGCTCC-3* (SEQJD.NO.:61; sense, 5* of the 



5 *-<X3AGGTTGCGCCTTAGCCACAGATGACC-3* (SEQJDJ<Oj62; armsense, 3* of stop 
codon) 

and human genomic DNA (Promega) as template. TaqPlus precision DNA 
polymerase (Stratagene) was used for the amplification in a lOOul reaction with 5% 
DMSO by the following cycle with step 2 to 4 repeated 35 times: 95 *C for 5 mis; 
95 "C for 30 sec; 65 *C for 30 sec; 72°C for 2 min; and 72°C for 5 min. 

A 1.3kb PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRD-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Temuantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:21 for nucleic acid sequence and 
SEQJD.NO.:22 for deduced amino acid sequence. 

L bRUP19(Seq.Id.Nos.23&24) 
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The disclosed human RUP19 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC026331 was identified as a human genomic sequence from chromosome 
12. The full length RUP19 was cloned by PCR using primers: 
5'-CTGCA<XXXX^CACT1XiCTCtO-3' (SHQJD-NO :63; sense, 5' of initiation codon), 
5 '-GTCTGC'l 1U 1 1CAGTGOCACTCAAC-3 • (SEQJDJsJO.:64; antisense, comaining the stop 



and human genomic DNA (Promega) as template. TaqPms Precision DNA 
polymerase (Stratagene) was used for the amplification with 5% DMSO by the 
10 following cycle with step 2 to 4 repeated 35 times: 94 *C for 1 min; 94 "C for 15 sec; 
70*C for 20 sec; 72°C for 1 min and 30 sec; and 72"C for 5 min. 

A 1.1 kp PCR fragment was isolated from 1% agarose gel and closed into the 
pCRTi-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Termiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:23 for nucleic acid sequence and 
1 5 SEQJD.NO.:24 for deduced ammo acid sequence. 

m. bRUP20 (Seq. Id. Nos. 25 [A 16) 
The disclosed human RUP20 was identified based upon the use of GeneBank 
While searching the database, a cDNA clone with Accession 
r AL161458 was identified as a human genomic sequence from chromosome 
20 1. The full length RUP20 was cloned by PCR using primers: 

5 '-TATCTGCAAT7CTATTCTAGCTOCTO-3 • (SEQJD_KO.:65; sense, 5' of initiation codon), 
5 -TGTCCCTAATAAAGTCACATGAATGC-3* (SEQJDJJOJ66; antisense, 3' of stop codon) 
and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
(Clonetech) was used for the ampHficafion with 5% DMSO by the following cycle with 
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step 2 to 4 repeated 35 times: 94*Cfor 1 min; 94*C for 15 sec; 60°C for 20 sec; 72*C 
for 1 min and 30 sec; and 72°C for 5 min. 

A 1.0 kp PCR fragment was isolated from 1% agarose gel and dosed into the 
pCRH-TOPO vector (mvitrogen) and completely sequenced using the ABI Big Dye 
5 Tcrmiantor Kit (P.E. Biosystem). See, SEQ.ID.NO.:25 for nucleic acid sequence and 
SEQ.ID.NO. :26 for deduced amino acid sequence. 

D. bRUPll (Seq. Id. Nos. 27 & 2S) 
The disclosed human RUP21 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA done with Accession 
10 Number AC026756 was identified as a human genomic sequence from chromosome 
13. The full length RUP21 was closed by PCR using primers: 
5*- GGAGACAACCATGAATGAGCCAC -3 ' (SEQ.ID.NO.:67; sense) 
5'- TATTTCAAGGGTTGTTTCAGTAAC -3 ' (SEQJD.NO.:68; antisense) 
and human genomic DNA (Promega) as template. Taq Plus Precision polymerase 
15 (Stratagene) was used for the amplification in a lOOul reaction with 5% DMSO by the 
following cycle with step 2 to 4 repeated 30 times: 94°C for 1 min; 94"C for 15 sec; 
55"C for 20 sec; 72*C for 1 min and 30 sec; and 72*C for 5 min. 

A 1 ,014 bp PCR fragment was isolated from 1% agarose gd and cloned into the 
pCRD-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
20 Termiantor Kit (P.E- Biosystem). See. SEQJD.NO.^7 for nucleic acid sequence and 
SEQJD.NO. :28 for deduced amino add sequence. 

c hRUP22 (Seq. Id. Nos. 29 A 30) 
The disclosed bursas RUP22 was identified based upon the use of GeneBank 
database information. While searching the database, a cDNA close with Accession 
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Number AC027026 wn identified as a human genomic sequence 
1 1. The fell length RUP22 was cloned by PCR using primers: 
5 *- GGCACCAGTGGAGo 1 1 1 iCTCAG CATO -3' {SEQUXNO.: 69; 



5 5 '-CTCATGOAAGTAGAGGCTO 1 CCATCTC»3 • (SEQJD.NO.:70; antisense, 3' of stop 



omic DNA (Promega) as template. TaqPhn Precision DNA polymerase 
(Stratagene) was used far the ampliiicanon in a lOOul reaction with 5% DMSO by the 
following cycle with step 2 to 4 repeated 30 limes: 94% 1 minutes 94%, 15 seconds 
10 55%, 20 seconds 72% 1.5 minute 72% 5 minutes. 

A 970bp PCR fragment was isolated from 1 % agarose gel and cloned into the 
pCRU-TQPO vector (Invitrogen) and completely sequenced using the AB1 Big Dye 
Tcrraiantor Kit (P-E. Biosystcm). See, SEQJDJNO.^9 for nucleic acid sequence and 
SEQJD.NO.:30 for deduced amino acid sequence. 

p. hRUP23(Seo,Id.Nos.3] A 32) 
The disclosed human RUP23 was identified based upon the use of OeneBank 
da t abas e information. While searching the database, a cDNA clone witl 
Number AO007104 was identified as a human genomic sequence from c 
4. The full length RUP23 was cloned by PCR using primers: 
5 '-CXrroGCOAGCCGCTAGCGOCATG-3 ' {SEQJDJ«>j71; tense, ATO as 



5 '-ATGAOOCC1 GOC>.QGCCCrCAGT-3 ' (SEQJDJ40j72; antisense, TCA as me stop 
codon) 

and human placenta Marathon-Ready cDNA (Ckmtech) as template. Advantage cDNA 
polymerase (Oontech) was used for the amplification in a 50u] reaction by the following 
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cycle with step 2 to 4 repealed 35 timer 95% for 30 sec; 95*C for 15 sec; 66% for 20 
sec; 72% for 1 min and 20 sec; and 72% for 5 min. 

A 1.0 kb PCR fragment was isolated and cloned into the pCRD-TOPO vector 
(Invitrogen) and completely sequenced using the ABI Big Dye Terminal or Kit (PJi 
Biosystcm). See, SEQJD.NO.31 for nucleic acid sequence and SEQ.ID.NO.J2 for 



q. bRUP24 (Seq. Id. Nos. 33 A 34) 

The disclosed human RUP25 was identified based upon the use of GencBank 
database information. While searching the database, a cDNA done with Accession 
Number AC02633 1 was identified as a human genomic sequence from chromosome 
1 2. The full length RUP25 was cloned by PCR using primers: 
5 '-GCTGGAGCATTCACTAGGOGAO-3 * (SEQJD-NO.:73; sense. 5'cf initiation codec* 
5*-AGATCCrGGTTCTTGGTfJACAATO-3* (SEQJD.NO.:74; antisense, 3' of stop codon) 
and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
(Oontech) was used for the amplification with 5% DMSO by the following cycle with 
step 2 to 4 repeated 35 times: 94% for 1 minute; 94% for 15 seconds; 56°C for 20 
seconds 72% for 1 minute 30 seconds and 72% for 5 minutes. 

A 1.2kb PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRn-TOPO vector (Invitrogen) and completely s oqiiciwx d using the ABI Big Dye 
Terrniantor Kit (P.E. Biosystcm). See, SEQJD.NO. 33 for nucleic acid sequence and 
SEQJD.NO. -M for deduced amino acid sequence. . 
r. hRUP25 (Seq. Id. No*. 35 & 3*) 

The disclosed human RUP25 was identified based upon the use of GencBank 
database information. While searching the database, a cDNA clone with Accession 
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Number AC026331 was identified as a human genomic sequence from chromosome 
12. The full length RUP25 was cloned by PCR using primers: 
5 , -GCTGGAOCATTCACTAOGCGAG-3 ' (SEQJD.NO.:75; sense, 5*of nuance) codonX 
5 *-AGATCCITKjTTCTTGGTGACAATG-3* (SEQID.NO.: 76; antisense, 3* of stop codon) 
5' ■ and human genomic DNA (Promega) as template. Advantage cDNA polymerase mix 
(Oontech) was used for the amplification with 5% DMSO by the following cycle with 
step 2 to 4 repeated 35 times: 94% for 1 minute; 94% for 15 seconds; 56°C for 20 
seconds 72% for 1 minute 30 seconds and 72% for 5 minutes. 

A 1 .7kh PCR fragment was i s ol ate d from 1% agarose gel and cloned into the 
10 pCRD-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Terrniantor Kit (PX. Biosystcm). See, SEQJD.NO. J 5 for nucleic acid sequence and 
SEQJD.NO. J6 for deduced amino acid sequence. 

s. bRUP26(Seq.Id.Nos.37A38) 
The disclosed human RTJP26 was identified based upon the use of GencBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC023040 was identified as a human genomic sequence from chromosome 
2. The full length RUP26 was cloned by RT-PCR using RUP26 specific primers: 
S--AGCCATOCCTGCCAOGAAGCATGG-3' (SEQJD.NO.:77; sense, containing initiation 
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20 5 '-OC>GACTOTOOACTCAAGAACTCTAGO-3 ' (SEQJDNO.:7S; antisense, containing stop 



and human pancreas Marathon - Ready cDNA (Oontech) as template. Advantage cDNA 
polymerase mix (Oontech) was used for the amplification in a lOOul reaction with 5% 
DMSO by the following cycle with step 2 to 4 repeated 35 times: 94% for 5 minute; 
25 95% for 30 seconds; 65% for 30 seconds 72% for 2 minute and 72% for 5 minutes. 
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A 1.1 kb PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRII-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
Terrniantor Kit (P.E. Biosystcm). See, SEQJDJJO. :37 for nucleic acid sequence and 
SEQJD.NO.: 38 for deduced amino add sequence. 
5 t. hRUP27 (Seq. Id. Nos. 39 & 40) 

The disclosed human RUP27 was identified based upon the use of GencBank 
database information. While searching the database, a cDNA clone with Accession 
Number AC027643 was identified as a human genomic sequence from chromosome 
12. The full length RUP27 was cloned by PCR using RUP27 specific primers: 
10 5 ^AGTOCAQGAACAATGAATOCATTTCATQ-3' (SEQ.ID.NO.:79; sense, containing 
tnination codon), 

5*-ATCATCTXrrAGACTCAT1GGTGATCC-3" (SEQJDJMO-80; antisense, 3' of Cop codon) 
and the human adult brain Mar»!hon-Ready cDNA (CJontrch) as template. Advantage cDNA 
polymerase mix (Oontech) was used for the amplification in a 50ul reaction with 5% 
15 DMSO by the following cycle with step 2 to 4 repeated 35 times: 94% for 1 minute; 
94% for 10 seconds; 58% for 20 seconds 72% for 1 minute 30 seconds and 72% for S 



A 1.1 kb PCR fragment was isolated from 1% agarose gel and cloned into the 
pCRU-TOPO vector (Invitrogen) and completely sequenced using the ABI Big Dye 
20 Terrniantor Kit (P.E. Biosystcm). See. SEQJD.NO.-.35 for nucleic acid sequence and 
SEQ.IDJNO.:36 for deduced amino acid sequence. The sequence of RUP27 cDNA 
clone isolated from human brain was determined to match with five unordered segments 
of AOG27643, indicating thai the RUP27 cDNA is composed of 5 exons. 
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PREPARATION OF NON-EiHDOCENOUS, CONSTtTUTIVELY ACTIVATED GPCRS 

Those skilled in the ait are credited with the ability to select techniques fox 
mutation of a nucleic acid sequence. Presented below are approaches utilized to 
S create non-endogenous versions of several of die human GPCRs disclosed above. 
The mutations disclosed below are based upon an algorithmic approach whereby the 
16* amino acid (located in the IC3 region of the GPCR) from a conserved proline (or 
an endogenous, conservative substitution therefore) residue (located in the TM6 
region of the GPCR, near the TM6/IC3 interface) is mutated, preferably to an alanine, 
10 histidine, argrninc or lysine amino acid residue, roost preferably to a lysine amino acid 



Preparation of nort-e 



GPCR* using Transformer Site-Directed™ 
manufacturer instructions. Two 



may be accomplished o 
Kit (Oontech) accordi 



marker ohgonucleotide. For convenience, the codon mutation to be incorporated ii 
human GPCR is also noted, in standard form (Table D): 



Receptor ldenttfkT 


Codon Mat* tkra 


kRUPS 


V274it 


UUJP9 


T249X } 


bAUPIO 


Ri32K 


hRUPll 


M294K 


hRUP12 


F220jt 


hRUFlft 1 


73m. 
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hkUPJ7 


Y21S* 


*RUP18 


— — i&z 


kRUP19 


T219* 


■RUP20 


K249A 
K348H 
K24SR 


DRUP2] 


R240X 


hktiptt 


YmyL 


hRUP24 




worn 


trio* 






hRUP27 


T248* 



2. QuOcChaage™ Site-Directed™ Mutagenesis 

Preparation of non-endogenous human GPCRs can also be accomplished by 
5 using QuikChange™ Site-Directed™ Mutagenesis Kit (Strstagene, according to 
manufacturer's instructions). Endogenous GPCR is preferably used as a template and 
two mutagenesis primers utilized, as well as, most preferably, a lysine mutagenesis 
oligonucleotide and a selection marker oligonucleotide (included in kit). For 
convenience, the codon mutation incorporated into the novel human GPCR and the 
10 respective oligonucleotides are noted, in standard form (Table E): 





Codoa 
MoUtioo 


(SEQJDJKX) nratanoo 
oaderthMd 


(MtbcBK) (SEQJDLNO.) 


Cycle Coadttioct* 
Mh,(«XSecf-) 
Cycles 2-4 
npcatctl 16 dmes 


1*1*13 


A26SK 


gc^Agg^aagcaa 


CCAGGAQAAOCACCT 


98* far 2' 

WfarSO" 

56«Cfcc3flr 

rrfar\i'4cr 

72* for 5' 




TTGCTTT00CTO0CC 
92) 


hRUPM 


L246k 


CAGtWKJOAACSKi 
CAOCATCATCATC (85) 


OATCAT^TCCTtrtf 
CTTTGCCTTCCTC (tS) 


98* for 2' 
98* for 30- 
WCfcrMT 
72* far If 40" 
72* far 5' 



. WO 01/36471 PCT/USOO/31509 



. hRUPli 


A398X. 


CCAGTOCAAAGCTAAG 
AAAOTGATCTTC (89) 


1 GAAGATCACI I'l CITA 
OCTTTOCACTGG(90) 


98* for 2* 
98* far 30" 
55*Cfor30" 
72-fcrll'40- 
72- Tor 5' 




W275li 


"ooSQccaCSjCSCCaa" 

GAGGAAGATTGGC (93) 


GOCAAlCUCCTClitt 

GOGOGGTOGCGGC 

(M) 


9rcorr 
°8*fpr3<r 

Se*Cfar30- 
72*fo» |1*40- 



The non-endogenous human GPCRs were then sequenced and the derived and 
. verified nucleic acid and amino acid sequences are listed in the accompanying 
5 "Sequence Listing" appendix to this patent document, as summarized in Table F 
below: 

TABLE F 



Nou l^oteootafciunno ™ 
GPCR 




AoiooAcUScqmac. 
Lbttai 


hRlfflS 






UtUrM 


5£QJlJ.NO.:B7 — 


SE0JT>.NO.:8fl 










SEOJD.KO.43 





Example 3 

10 RECEPTOR EXPRESSION 

Although a variety of cells are available to the art for the expression of 
proteins, it is most preferred that mammalian cells be utilized. The primary reason for 
this is predicated upon practicalities, L*.. utilization of, eg., yeast cells for the 
1 5 expression of a GPCR, while possible, introduces into the protocol a non- mammalian 
cell which may not (indeed, in the case of yeast, does not) include the receptor- 
coupling, genetic- mechanism and secretary pathways that have evolved for 
mammalian systems - thus, results obtained in non-mammalian cells, while of 

41 
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potential use, are not as preferred as that obtained from mammalian cells. Of the 
mammalian cells, COS-7, 293 and 293T cells are particularly preferred, although the 
specific mammalian cell utilized can be predicated upon the particular needs of the 

S a. Transient Translation 

On day one, 6x10*/ 10 cm dish of 293 cells well were plated out. On day two, 
two reaction tubes were pr ep a red {the pr oportions to follow for each tube are per plate): 
tube A was prepared by mixing 4pg DNA (e.g, pCMV vector; pCMV vector with 
receptor cDNA, etc) in 0.5 ml serum free DMEM (Giboo BRL); tube B was prepared by 

10 mixing 24ul lipofectarnine (Gibco BRL) in 0.5ml serum free DMEM. Tubes A and B 
were admixed by inversions (several times), followed by incubation at room temperature 
for 30~45tnm The admixture is referred to as the "Transfection mixture''. Plated 293 
cells were washed with 1XPK5, followed by addition of 5 nil serum free DMEM. 1ml 
of the transfection mixture were added to the cdb, followed by incubation for 4hrs at 

15 37"C/5% CO,. The transfection mixture was removed by aspiration, followed by the 
addition of 10ml of DMEM/10% Fetal Bovine Serum. Cells were incubated at 37*C/5% 
COi- After 48hr incubation, cells were harvested and utilized for analysis, 
b. Stable CeTJ Lines: Gs Fusion Protein 

Approximately 12x10* 293 cells are plated on a 15cm tissue culture plate. 

20 Grown in DME High Glucose Medium containing ten percent fetal bovine serum and 
one percent sodium pyruvate, L-glutaminc, and ann-biott.es. Twenty-four hours 
following plating of 293 cells to -80% confruency, the cells are transfected using 12ug 
of DNA. The 12ug of DNA is combined with 60ul of lipofectarnine and 2mL of DME 
High Glucose Medium without serum. The medium is aspirated from the plates and the 

25 cells are washed once with medium without serum. The DNA, lipofectarnine and 

42 
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; b added to the plate along with lOmL of medium without i 
Following inoibation it 37 degree* Celsius for four to five boms, the medium is 
aspirated end 25ml of medium containing seram is added. Twenty- four hours following 
transection, the medium is aspirated again, and fresh medium with scrum is added. 
5 Forty-eight hours following transfection, the medium is aspirated and medium with 
serum is added containing geneticin (G418 drug) at a final concentration of SOOpg/rnL. 
The trausfected celh now undergo selection for positively transfected cells containing 
the G418 resistant gene. The medium is replaced every four to five days as selection 
occurs. During selection, ceDs are grown to create stable pools, or split for stable clonal 
10 selection. 



ASSAYS FOR DETERMINATION OF CONSTITUTIVE ACTIVITY 



15 A variety of approaches are available for assessment of constitutive activity of 

the non-endogenous human GPCRs. The following are Illustrative; those of ordinary 
skill in die art are credited with the ability to determine those techniques thai are 
preferentially beneficial for the needs of the artisan. 

1. Membrane Binding Assays: l^SJGTPyS Assay 

20 

When a G protein-coupled re ce ptor is in its active state, either as a result of 
ligand binding or constitutive activation, the receptee couples to a G proton and 
stimulates die release of GDP and subsequent binding of GTP to the G protein. The 
alpha subunit of die G protein-receptor complex acts as a GTPase and slowly hydro lyzes 
25 the GTP to GDP, at which point the receptor normally is deactivated. Constitutive^/ 
activated receptors continue to exchange GDP for GTP. The non-hydrolyzable GTP 
analog. [^JGTPyS, can be utilized to demonstrate enhanced binding of ["SJGTPyS to 
membranes expressing constitutrvery activated receptors. The advantage of using 
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( 3% S)GTPyS binding to measure constitutive activation is that: (a) it is gencrical In- 
applicable to all G proteto-coupled receptors; (b) it is proximal at the membrane surface 
making it less likely to pick-up molecules which affect the intracellular cascade. 

The assay utilizes the ability of G protein coupled re c eptors to stimulate 
[ js S)GTPyS binding to membranes expressing die relevant receptors. The assay can, 
therefore, be used in the direct identification method to screen candidate compounds to 
known, orphan and oonstitutivery activated G protein-coupled receptors. The assay is 
' generic and has application to drug discovery at all G protein-coupled r e cept ors. 

The ["SJGTPyS assay was incubated in 20 mM HEPES and between 1 and 
about 20mM MgCb. (this amount can be adjusted for optimization of results, although 
20mM is preferred) pH 7.4, binding buffer with between about 03 and about 12 nM 
["SjGTPyS (this amount can be adjusted for optimization of results, although 1-2 is • 
preferred ) and 125 to 75 pg membrane protein (eg, 293 cells expressing the Gs Fusion 
Protein; tins amount can be adjusted for optimization) and 10 pM GDP (this amount can 
be changed for optimization) for 1 hour. Wheatgerm agglutinin beads (25 pi; 
Amersham) were then added and the nurture incubated for another 30 minutes at room 
te mp er a t u re. The tubes were then centrifugal at 1500 x g for 5 minutes at room 



2. Adeuylyl Cyclase 
A Flash Plate™ Aderrylyl Cyclase kit (New England Nuclear, Cat No. 
SMP004A) designed for cell-based assays can be modified for use with crude plasma 
membranes. The Flash Plate wells can contain a scmtQlant coating which also contains a 
specific antibody recognizing cAMP. The cAMP generated in the wells can be 
quantitated by a direct competition for binding of radioactive cAMP tracer to the cAMP 
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antibody. The following serves as a brief protocol for the n 
cAMP levels in whole cells thai express the receptors. 

Transfected cells were harvested approximately twenty four hours after transient 
transfection. Media is carefully aspirated off and discarded. 10ml of PBS is gentry 
S added to each dish of cells followed by careful aspiration. 1ml of Sigma cell 
dissociation buffer and 3ml of PBS are added to each plate. Cells were pipeted off the 
plate and the cell suspension was collected into a 50ml conical centrifuge tube. Cells 
were then cento fugrd at room te m pera ture at 1,100 rpm for 5 min. The cell pellet was 
carefully re-suspended into an appropriate volume of PBS (about 3mUplate). The cells 

10 were then counted using a hemocytometer and additional PBS was added to give the 
appropriate number of cells (with a final volume of about 50 pi/well). 

cAMP standards and Detection Buffer (comprising 1 uCS of tracer cAMP 
(50 pi] to 1 1 ml Detection Buffer) was prepared and maintained in anrordance with the 
manufacturer's instructions. Assay Buffer was prepared fresh for screening and 

15 contained 50ul of Stimulation Buffer, 3ul of test compound (12uM final assay 
concentration) and 50pl cells, Assay Buffer was stored on toe until utilized. The assay 
was initiated by addition of 50ul of cAMP standards to appropriate wells followed by 
addition of 50ul of PBSA to wells H-ll and HI2. 50pl of Stimulation Buffer was added 
to all wella. DMSO (or selected candidate compounds) was added to appropriate wells 

20 using a pin tool capable of dispensing 3 pi of compound solution, with a final assay 
concentration of 12pM test compound and lOOul total assay volume. The cells were 
then added to the wells and incubated for 60 min at room temperature. lOOuJ of 
Detection Mix containing tracer cAMP was then added to the wells. Plates were then 
incubated additional 2 hours followed by counting in a Wallac MicraBcta scintillation 



counter. Values of cAMPAvell were then extrapolated from a standard cAMP curve 
which was contained within each assay plate. 

3. Cell-Based cAMP for Gi Coupled Target GPCRs 

5 TSHR is a Gs coupled GPCR that causes the accumulation of cAMP upon 

activation. TSHR will be ccmstitutiveh/ activated by mutating amino acid residue 623 
(i.e., changing an alanine residue to an isoleucme residue). A Gi coupled receptor is 
expected to inhibit adenylyl cyclase, and, therefore, decrease the level of cAMP 
production, which can make assessment of cAMP levels challenging. An effective 
10 technique for measuring the decrease in production of cAMP as an indication of 
constitutive activation of a Gi coupled receptor can be acco mplishe d by co- trans fee ting, 
most preferably, non-endogenous, constitutive^ activated TSHR CTSHR-A623I) (or an 
endogenous, constitutive^ active Gs coupled receptor) as a "signal enhancer" with a Gi 
linked target GPCR to establish a baseline level of cAMP. Upon creating a non- 
15 endogenous version of the Gi coupled receptor, this non-endogenous version of the 
target GPCR is then co- transfected with the signal enhancer, and it is this material that 
can be used for screening. We will utilize such approach to effectively generate a signal 
when a cAMP assay is used; this approach is preferably used in the direct identification 
of candidate compounds against Gi coupled receptors. It is noted that for a Gi coupled 
20 GPCR, when this approach is used, an inverse agonist of the target GPCR will increase 
the cAMP signal and an agonist will decrease the cAMP signal 

On day one, 2X10* 293 and 293 cells/well will be plated out On day two, two 
reaction tubes win be prep ar ed (the prop or tions to follow for each tube are per plate): 
tube A win be prepared by mixing 2ug DNA of each receptor transfected into the 
25 mammalian celh, for a total of 4ug DNA (eg., pCMV vector, pCMV vector with 
mutated THSR (TSHR-A623I); TSHR-A623I and GPCR, etc) in 1.2ml serum free 



« prepared b 



DM EM (I/vine Scientific, Irvine, CA); tube B wfll be prepTTCU by mixing 120pJ 
Hpofectamine (Gibco BRL) in 1.2m) serum free DMEM Tubes A and B will then be 
admixed by inversions ( sever*] tuna), fallowed by incubation al room temperature for 
30-45min. The admixture is referred to as the "tiansfectron mixture**. Plated 293 cells 
win be washed with 1XPBS, followed by addition of 10ml serum free DMEM. 2.4ml of 
the tnnsfectioo mixture will then be added to the celts, followed by incubation for 4hrs 
at 37°Ctt%.CO}. The transfection mixture win then be removed by aspiration, followed 
by the addition of 23ml of DMEM/ 10% Fetal Bovine Serum. Cells will then be 
incubated at 37*05% COj. After 24hr incubation, cells will then be harvested and 
utilized Car analysis. 

A Flash Plate™ Adenyryl Cyclase kit (New England Nuclear, Cat No. 
SMP004A) is designed for cell-based assays, however, can be modified for use with 
crude plasma membranes depending on the need of the ski Bed artisan. The Flash Plate 



recognizing cAMP. The cAMP generated in the wdb can be quanlitated by a direct 
competition for binding of radioactive cAMP tracer to the cAMP antibody. The 
following serves as a brief protocol for the measurement of changes in cAMP levels in 
whole cells that express the receptors. 

Transfected cells will be harvested approximately twenty four hours after 
transient transection. Media win be carefully aspirated off and discarded. 10ml of PBS 
will be gently added to each dish of cells followed by careful aspiration, lml of Sigma 
cell dissociation buffer and 3m] of PBS will be added to each plate. Cells will be pipetcd 
off the plate and the cell suspension will be collected into a 50ml corneal centrifuge tube. 
Ceus win then be centrifuged at room te mp e ra ture at 1,100 rpm for 5 mm. The cell 
pellet will be carefully re-snspended into an appropriate volume of PBS (about 
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3 ml/plate). The cells will then be counted using a bernocytoroetex and additional PBS is 
added to give the ap pro pri ate number of cells (with a final volume of about 
50u*/weIJ). 

cAMP standards and Detection Buffer (comprising 1 uQ of tracer [ 125 I cAMP 
5 (50 pi} to 11 ml Detection Buffer) will be prepared and maintained in accordance with 
the manufacturer's instructions. Assay Buffer should be p rep ar ed fresh for screening 
and contained 50ul of Stimulation Buffer, 3ul of test compound (12uM final assay 
concentration) and 50ul cells. Assay Buffer can be stored on ice until utilized. The assay 
can be initiated by addition of 50ul of cAMP standards to appropriate weOs followed by 

10 addition of 50ul of PBSA to wcUs H- 11 and H12. 50ul of Strrnulation Buffer win be 
added to all wells. Selected compounds (eg.. TSH) win be added to appropriate wells 
using a pin tool capable of dispensing 3ul of compound solution, with a final assay 
concentration of 12pM test compound and lOOul total assay volume. The cells win then 
be added to the weDs and incubated for 60 mm at room temperature. lOOul of Detection 

15 Mix comaining tracer cAMP win then be added to the wells. Plates were then incubated 
additional 2 hours followed by counting in a Wallac MkroBeta yjntinfftwm counter. 
Values of cAMP/weU will then be extrapolated from a standard cAMP curve which is 
contained within each assay plate. 

4. Reporter-Based Assays 

20 a. CRE-Luc Reporter Assay (Gs-assodated reeeptori) 

293 and 293T cells are plated -out on 96 well plates at a density of 2 X 10* cells 
per well and were transfected using Upofectanune Reagent (BRL) the following day 
according to manufacturer instructions. A DNA/lipid mixture is prepared for each 6- 
well tnmsfectjon as follows: 260ng of plasnrid DNA in lOOpl of DMEM were gently 

25 mixed with 2 pi of lipid in 1 OOp] of DMEM (the 260ng of plasmid DNA consisted of 
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200ng of a 8xCRE-Luc reporter plasmid, 50ng of pCMV comprising endogenous 
receptor or non-endogenous receptor or pCMV alone, and lOng of a GPRS expression 
plasmid (GPRS in pcDNAJ (Invitrogen)). The 8XCRE-Luc reporter plasmid was 
p rep are d as follows: vector SRIF-fJ-gal was obtained by cloning the rat somatostatin 
5 promoter (-71/+51) at BglV-Hindlll site in the p£ gal-Basic Vector (Ciontech). Eight 
(8) copies of cAMP response element were obtained by PCR from an adenovirus 
template AdpCF126CCRE8 (see, 7 Human Gene Therapy 1883 (1996)) and cloned 
into the SRTF-p-gal vector at the Kpn-BglV site, resulting in the 8xCRE-fi-gal 
reporter vector. The 8xCRE-Luc reporter plasmid was generated by replacing the 

10 bcta-galact os i d ase gene in the 8xCRE-0-ga! reporter vector with the tuciferase gene 
obtained from the pGL3 -basic vector (Pro mega) at the HindlH-BamHI site. 
Following 30 min. incubation at room temperature, the DNA/lipid mixture was 
diluted with 400 pi of DMEM and lOOul of the diluted mixture was added to each 
well. 100 pi of DMEM with 10% PCS were added to each well after a 4hr incubation 

15 m a ceU culture incubator. The following day the transfected cells were changed with 
200 ul/well of DMEM with 10% FCS. Eight (8) hours later, the wells were changed 
to 100 pi /well of DMEM without phenol red, after one wash with PBS. Luciferase 
activity were measured the next day using the LucLrte™ reporter gene assay kit 
(Packard) following manufacturer instructions and read on a 1450 MicroBeta™ 

20 scintillation and luminescence counter (Wallac). 

b. API reporter assay (Gq-aosociatcd receptors) 
A method to detect Gq stimulation depends on the known property of Gq- 
dependent phosphoUpase C to cause the activation of genes containing API elements 

25 in their promoter. A Pathdetect™ AP-1 cis-Reporting System (Stratagene, Catalogue 
# 219073) can be utilized following the protocol set forth above with respect to the 

49 
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CREB reporter assay, except that the components of toe calcium phosphate precipitate 
were 410 ng pAPl-Luc, 80 ng pCMV-receptor expression plasmid, and 20 ng CMV- 
SEAP. 

e. Srf-Luc Reporter Assay (Gq- associated receptors) 

5 One method to detect Gq stimulation depends on the known property of Gq- 

dependent phospholipase C to cause the activation of genes containing serum 
response factors in their promoter. A Pathdetect™ SRF-Loc-Reporting System 
(Stratagene) can be utilized to assay for Gq coupled activity in, eg., COS7 cells. 
Cells are transfected with the plasmid components of the system and the indicated 

10 expression plasmid encoding endogenous or non-endogenous GPCR using a 
Mammalian Transection™ Kit (Stratagene, Catalogue #200285) according to the 
manufacturer's instructions. Briefly, 410 ng SRF-Luc, 80 ng pCMV-receptof 
expression plasmid and 20 ng CMV-SEAP (secreted alkaline phosphatase expression 
plasmid; alkaline phosphatase activity is measured in the media of transfected cells to 

15 control for variations in n-ansfection efficiency between samples) are combined in a 
calcium phosphate precipitate as per the rnanufactnrer's rnstructiona. Half of the 
precipitate is equally distributed over 3 wells in a 96-we!l plate, kept on the cells in a 
serum free media for 24 hours. The last 5 hours the cells are incubated with luM 
Angiotensin, where indicated. Cells are then lysed and assayed for hxaferase activity 

20 using a Luclite™ Kit (Packard, Cat U 601 691 1) and Trihrx 1450 Microbeta** liquid 
scintillation and luminescence counter (Wallac) as per the manufacturer's 
instructions. The data can be analyzed using GraphPad Prism™ 2.0a (GraphPad 
Software Inc.). 

25 

50 
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On day 1. cells comprising the receptors (endogenous and/or non-endogenous) 
5 can be plated onto 24 weD plates, usually 1x10 s cells/well (although his umber can be 
optimized. On day 2 cells can be transfected by firstly mixing C25pg DNA in SO pi 
serum free DMEMAvcU and 2 pi Hpofectamirte in 50 pi senrmfiee DMEM/welL The 
solutions ere gently mixed and incubated Tor 15-30 rain at room temperature. Cells are 
washed with OS ml PBS and 400 ul of serum free media is mixed with the transection 

10 media and added to the cells. The cells are then incubated for 3-4 his at 37*C/5%COj 
and then the transfeetion media is removed and replaced with 1 ml/well of regular growth 
media. On day 3 (be cells are labeled with ^-myoinositol. Briefly, the media is 
removed and the cells are washed with 0.5 ml PBS. Then 03 ml inoritol-free/serum free 
media (GIB CO BRL) is added/well with 0.25 pG of ^-myoinositol/ well and the cells 

15 are incubated for 16-18 hrs cVn at 37~C/5%COj . On Day 4 the cells are washed with 0.5 
ml PBS and 0.45 ml of assay medium is added containing inositol- frec/serurn free media 
10 pM pargymoe 10 mM lithium chloride or 0.4 ml of assay medium and 50pl of lOx 
ketanserin (kef) to final concentration of lOpM. The cells are then incubated for 30 nun 
at 37*C The cells are then washed with 0J ml PBSand 200pl of fresMcecold stop 

20 solution (1M KOH; 18 raM Na-borate; 3.8 mM EDTA) is added/well. The solution is 
kept on ice for 5-10 nun or until cells were lysed and then neutralized by 200 ul of 
fresh/ice cold neutralization soL (7.5 % HCL). The hysate is then transferred into 1_5 ml 
eppendorf tubes and 1 ml of chJcrofonu/raethanol (12) is added/ rub e. The solution is 
vortex ed for 15 sec and the upper phase is applied to a Biorad AG1-X8™ anion 

25 exchange resin (100-200 mesh). Firstly, the resin is washed with water at 1:1.25 W/V 
and 0.9ml of upperpbase is loaded onto the column. The column is washed with lOmls 
of 5 mM myoinositol and 10 ml of 5 mM Na-borate/60mM Na-forrnate. The inositol 



tris phosphates are eruted into scintillation vials containing 10 ml of scintillation cocktail 
with 2 ml of 0.1 M formic acid/ 1 M arnmonium formate The columns are regenerated 
by washing with 10 ml of 0.1 M formic acid/3M ammonium formate and rinsed twice 
with dd HjO and stored at 4"C m water. 
5 Exemplary results are presented below in Table G: 
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Exemplary results of GTPyS assay for detecting constitutive activation, as 
disclosed in Example 4(1) above, was accomplished utilizing GsJusion Protein 
Constructs on human RUP13 and RUP15. Table H below lists the signals generated 
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FUSION PROTEIN PREPARATION 

», GPCRtGs Fusion Constnet 

The design of the ccmstitmively activated GPCR-G protein fi 
accomplished as follows: both the 5' and 3* ends of the rat G protein Gsa (long form; 
Itoh, H. et d, 83 3776 (1986)) were engineered to include » Hindm (5«- 

AAGCTT-3') sequence thereon. Following confmrurhon of the comet setmence 
fincluding the fumking Hindm seouencesX the entire sequence 
pcDNA3.1(.) (Invitmgen. cat no. V795-20) by subcloning using the rfindTfJ r 
site of that vector. The correct orientation for the Gsa sequence was d 
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subcloning into pcDNA3.1(-)- The modified pcDNA3.1(-) containing the rat Gsa gene 
I sequence was then verified; this vector was now available as a "universar 
tor. The pcDNA3.1(-) vector contains a variety of well-known 
restriction sites upstream of the Hindm site, thus beneficially providing the ability to 
5 insert, upstream of the Gs protein, the coding sequence of an endogenous, constitutively 
active GPCR. This same approach cim be utilized to create other ^Iniversa^ G protein 
vectors, and, of course, other commercially available or proprietary vectors known to the 
artisan can be utilized - the important criteria is that the sequence for the GPCR be 
upstream and hvfiame with that of the G protein. 
10 RUP13 couples via Gs. For the following exemplary GPCR Fusion Proteins, 

fusion to Gsa was accomplished. 

A RUPI3-Gsa Fusion Protein construct was made as follows: primers were 
designed as follows: 

5 ^gatcfTCTAGAATJGGAGTCCTCAC<X^TCCC^CAG -3* (SEQJDJ«>^97; sense) 
15 5^gatetGATATC]OGTOACTCCAGCOGGGGTQAOGCGGC>3* (SEQJIxWO^I; ardisense). 



(designated m brackets) between me G protein and RUP13. The sense i 
primmrnchjdedtberestrk respectively . such that spacers 
(attributed to the lestrictien sites) exists between the G protein and RUP15. 

PGR was then utilized to secure the respective receptor sequences for fusion 
within the Gsa universal vector disclosed above, using the following protocol for each: 
lOOng cDNA for RUP15 was added to separata tubes cxartairting 2ul of each primer 
(sense and arrti-aertse), 3uL of lOmM dNTPa, lOuL of lOXTaqPlus^ Precision buiTer, 
lul, of TaqPlus™ Precision porymerase (Stratagene: #60021 IX and 80pi- of water. 
Reaction temperatures and cyde tones for RUP15 were as follows with cycle steps 2 
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through 4 were repeated 35 times: 94 "C for 1 min, 94 °C 



seconds; 62°C for 20 

sec; 72*C 1 min 40sec; and 72° C 5 min . PCR product far was run on a 1% agarose 
gel and then purified (data not shown). The puriSed product was digested with Xbal and 
EcoRV and the desired inserts purified and li gated into the Gs universal vector at the 
resp e cti ve restriction she. The positive clones was isolated following transformation and 
determined by restriction enzyme digest; expression using 293 cells was accomplished 
following the protocol set forth infra. Each positive clone for RUP1 5-Gs Fusion Protein 
was sequenced to verify correctness. (See, SEQJD-NO.;99 for nucleic acid sequence 
and SEQ.ID.NO.: 100 for amino acid sequence \ 

RUP15 couples via Gs. For the following exemplary GPCR Fusion Proteins, 

A RUP15-Gsa Fusion Protein construct was made as follows: primers were 



5-TCTAGAATX^amX^CCrcCACCAACAGC-3' (SEQ.tD NO.: 101 ; sense) 
1 5 5 --gatatoGCAGGAAAAGTAGCAG AATCGTAGG AAG-3' (SEQIDJ40.: 1 02; ■ 

Nucleotides in lower caps are included as spacers in the restriction shea between 
the G protein end RUP15. The sense and anti-sense primers included the restriction sites 
for EcoRV and Xbal, respectively, such that spacers (attributed to the restriction sites) 
exists between fee G protein and RUP15. 
20 PCR was men utilized to secure the respective receptor sequences for fusion 

within the Gsa universal vector disclosed above; using the following protocol for each: 
lOOng cDNA for RUP15 was added to separate tubes containing 2 pi of each primer 
(sense and anti-sense), 3uL of lOmM dNTPs, lOpL of lOXTaqPlus 7 * Precision buffer, 
luL of TaqPlus™ Precision polymerase (Stratagenc: #60021 IX and 80uL of water. 
25 Reaction temperatures and cycle times for RUP15 were as follows with cycle steps 2 
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through 4 were repeated 35 times: 94 "C for 1 min; 94 "C for 30 seconds; 62*C for 20 
sec; 72 "C 1 min 40sec; and 72* C 5 min . PCR product for was run on a 1% agarose 
gel and then purified (data not shown). The purified product was digested ). The 
purified product was digested with EcoRV and Xbal and the desired inserts purified and 
li gated into the Gs universal vector at the respective restriction site, The positive clones 
was isolated following transformation and determined by restriction enzyme digest; 
expression using 293 cells was accomplished following the protocol set forth infra. 
Each positive clone for RUP 1 5-Gs Fusion Protein was sequenced to verify correctness. 
(Sec, SEQJD.NO.: 103 for nucleic acid sequence and SEQJD.NO.: 1 04 for amino acid 



b. Gq(6 amino acid ddetionVGi Fusion Construct 

The design of a Gq (dciyGi fusion construct can be a 
the N-texminal six (6) ammo acids (amino acids 2 through 7. having the sequence of 
TLESIM (SEQ.ID.NO.: 129) Goq-subunit win be deleted and the G-terminal five (5) 
amino acids, having the sequence EYNLV (SEQ.ID.NOj 130) will be replace with the 
corresponding amino acids of the Geo Proton, having the sequence DCGLF 
(SEQ.ID.NO.:131). This fusion construct will be obtained by PCR using the following 



5 '-^tc««cttcCATGGCGTGCTGCCTGAGCGAGGAG-3 * (SEQJDNO.: 1 32) and 

5'^tcggatr«TTAGAAGAGGCCGCAGTCCT^ • 
(SEQJDJJO.:133) 

and Plasxnid 63313 which contains the mouse Gaq-wild type version w 



TaqPlus Precision DNA polymerase (Stratagenc) will be utilized for the 
amplification by foe following cycles, with steps 2 through 4 repeated 35 times: 95°C 
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for 2 min; 95°C for 20 sec; S6"C for 20 sec; 72*C for 2 min; and 72°C for 7 min. The 
PCR product will be cloned into a pCRH-TOPO vector (Invitrogen) and sequenced 
using the ABI Big Dye Terminator kit (P.E. Biosystem). Inserts from a TOPO clone 
containing the sequence of the fusion construct will be shuttled into the expression 
vector pcDNA3.1(+) at the Hmdffl/BamHI site by a 2 step cloning process. 
Example 6 

Tissue Distribution of the disclosed human GPCRs: RT-PCR 

RT-PCR was applied to confirm the expression and to determine the tissue 
distribution of several novel human GPCRs. Oligonucleotides utilized were GPCR- 
specific and the human multiple tissue cDNA panels (MTC, Clontech) as templates. 
Taq DNA polymerase (Stratagene) were utilized for the amplification in a 40ul 
reaction according to the manufacturer's instructions. 20pl of the reaction will be 
loaded on a I .$% agarose gel to analyze the RT-PCR products. Table J below lists the 
receptors, the cycle conditions and the primers utizilized. 
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hKU>12 


94' to V 


OCAGTGATCJ 


1 CAGACACTT I 


490bp 
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hRUT13 
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Brain 
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Example 7 

Protocol: Direct Identification of Inverse Agonists mod Agonists 
A- t^SJGTPyS Assay 

5 Although we have utilized endogenous, cocstitutively active GPCRs for the 

direct identification of candidate compounds as, e.g., inverse agonists, for reasons that 
are not altogether understood, infra-assay variation can become exacerbated. 
Preferably, then; a GPCR Fusion Protein, as disclosed above, is also utilized with a non- 
endogenous, constitatrvery activated GPCR. We have determined that when such a 

10 protein is used, intra-assay variation appears to be substantially stabilized, whereby an 
effective rignal-to-noise ratio is obtained. This has the beneficial result of allowing for a 
■ more robust identification of candidate compounds. Thus, it is preferred that for direct 
identification, a GPCR Fusion Protein be used and that when utilized, the following 
assay protocols be utilized. 

]5 1. Membrane Preparation 

Membranes comprising the constitufiveh/ active orphan GPCR Fusion Protein of 
interest and for use in the direct identification of candidate compounds as inverse 
agonists, agonists or partial agonists are preferably prepared as follows: 

a. Materials 

20 "Membrane Scrape Buffer** is comprised of 20mM HEPES and lOmM EDTA, 

pH 7.4; "Membrane Wash Buffer** is comprised of 20 mM HEPES and 0.1 mM 
EDTA, pH 7.4; •'Binding Buffer** is comprised of 20mM HEPES, 100 mM NaCl, and 
10 mM MgCU, pH 7.4 

b. Procedure 

25 All materials will be kept on ice throughout die procedure. Firstly, the media 

will be aspirated from a confluent monolayer of cells, followed by rinse with 10ml cold 

M 
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PBS, followed by aspiration. Thereafter, 5 ml of Membrane Scrape Buffer will be added 
to scrape cells; this will be followed by transfer of cellular extract into 50ml centrifuge 
tubes (centrifuged at 20,000 rpm for 17 minutes at 4*Q. Thereafter, the supernatant will 
be aspirated and the pellet will be resuspended in 30ml Membrane Wash Buffer 
5 followed by centrifuge at 20,000 rpm for 17 minutes at 4"C The supernatant win then 
be aspirated and the pellet resuspended in Binding Buffer. This will then be 
honwgenized using a Brinkman porytron™ bomogenizcr (15-20 second bursts until the 
' all material is in suspension)- This is referred to herein as "Membrane Protein**. 
2. Bradford Protein Assay 

10 Following the homogenizaticm, protein concentration of the membranes will 

be determined using the Bradford Protein Assay (protein can be diluted to about 
1.5mg/ml, aliquoted and frozen (-80*C) for later use; when frozen, protocol for use 
will be as follows: on the day of the assay, frozen Membrane Protein is thawed at 
room temperature, followed by vortex and then homogenized with a polytron at about 

15 12 x 1,000 rpm for about 5-10 seconds; it was noted that for multiple preparations, the 
homogenizor should be thoroughly cleaned between hornoginczarion of different 
preparations). 

a. Materials 

Binding Buffer (as per above); Bradford Dye Reagent; Bradford Protein 
20 Standard will be utilized, following manufacturer instructions (Biorad, cat no. 500- 
0006). 

b. Procedure 

Duplicate tubes will be prepared, one including the membrane, and one as a 
control "blank**. Each contained SOOul Binding Buffer. Thereafter, lOul of Bradford 
25 Protein Standard (Img/tn!) will be added to each tube, and 1 Opl of membrane Protein 
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will then be added to just one tube (not the blank). Thereafter, 200ul of Bradford Dye 
Reagent will be added to each tube, followed by vortex of each. After five (5) 
minutes, the tubes will be re-vortexed and the material therein will be transferred to 
cuvettes. The cuvettes will then be read using a CECIL 3041 spectrophotometer, at 
5 wavelength 595. 

3. Direct Identification Assay 

a. Materials 

GDP Buffer consisted of 37 5 ml Binding Buffer and 2mg GDP (Sigma, cat no. 
G-7127), followed by a series of dilutions in Binding Buffer to obtain 0.2 uM GDP 
10 (final concentration of GDP in each well was 0.1 uM GDP); each well comprising a 
candidate compound, has a final volume of 200ul consisting of lOOpI GDP Buffer (final 
concentration, O.IpM GDP), 50ul Membrane Protein in Binding Buffer, and 50pl 
[ 3S S)GTPyS (0.6 nM) in Binding Buffer (2.5 pi I 33 SJGTPyS per 10ml Binding Buffer). 

b. Procedure 

. 15 Candidate compounds will be preferably screened using a 96-wcil plate format 

(these can be frozen at -80*Q. Membrane Protein (or rnembranes with expression 
vector excluding the GPCR Fusion Protein, as control), will be homogenized briefly 
until in suspension. Protein conce nt r ati on will then be determined using the Bradford 
Protein Assay set forth above. Membrane Protein (and control) will then be diluted to 
20 0.25mg/ml in Binding Buffer (final assay concerttration. 12.5ug/well). Thereafter, 100 
ul GDP Buffer was added to each well of a Wallac Scintistrip™ (Wallac). A 5ul pin- 
tool will then be used to transfer 5 pi of a candidate compound into such well (Le^ Spl in 
total assay volume of 200 pi is a 1 :40 ratio such that the final screening concentration of 
the candidate compound is lOuM). Again, to avoid contannriation, after each transfer 
25 step the pin tool should be rinsed in three reservoirs comprising water (1 XX cthanol (IX) 



and water (2X) - excess liquid should be shaken from the too^Hweach rinse and dried 
with paper and kimwipes. Thereafter, 50 pi of Membrane Protein wiQ be added to each 
well (a control wdl com prising membranes without the GPCR Fusion Protein was also 
utilized), and pre- incubated for 5-10 minutes at room temperature. Thereafter, 50ul of 
5 [ ,5 S]GTPyS (0.6 nM) in Binding Buffer will be added to each well, followed by 
incubation on a shaker for 60 minutes at room temperature (again, in this example, plates 
were covered with foil). The assay will then be stopped by spinning of the plates al 4000 
RPM for IS minutes at 22*C The plates will then be aspirated with an 8 channel 
. manifold and sealed with plate covers. The plates will then be read on a Wallace 1450 
10 using setting "Prot #37" (as per manufacturer instructions). 
B. Cyclic AMP Assay 

Another assay approach to directly identified candidate compound was 
accomplished by utilizing a cyclase-based assay. In addition to direct identification, this 
IS assay approach can be utilized as an independent approach to provide confirmation of 
the results from the [ ,3 S]GTPyS approach as set forth above. 

A modified Flash Plate™ Adeoyiyl Cyclase kit (New England Nuclear, Cat No 
SMP004A) was preferably utilized for direct identification of candidate compounds as 
inverse agonists and agonists to constitutively activated orphan GPCRs in accordance 
20 with the following protocol. 

Transfected cells were harvested approximately three days after transfectjon. 
Membranes were p rep ar ed by bomogenization of snspendrd cells in buffer containing 
20mM HEPES, pH 7.4 and lOmM MgClj. Homogenization was performed on ice using 
a Brinkman Po!ytron™ for approximately 10 seconds. The resulting homogenate is 
25 centrifuged at 49,000 X g for 15 minutes at 4°C The resulting pellet was men 
resuspended in buffer containing 20mM HEPES, pH 7.4 and 0.1 mM EDTA, 
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homogenized for 10 seconds, followed by centrifugation at 49.000 X g for IS minutes at 
4°C. The resulting pellet was men stored at -80°C until utilized. On the day of direct 
identification screening, the membrane pellet as slowly thawed at room temperature, 
resuspended in buffer containing 20mM HEPES, pH 7.4 and lOmM MgCL2, to yield a 
5 final protein concentration of 0.60mgAnl (the resuspended membranes are placed on ice 
until use). 

cAMP standards and Detection Buffer (comprising 2 uCi of tracer [ IJ5 I cAMP 
(100 pf) to 11 ml Detection Buffer) were prepared and maintained in accordance with 
the manufacturer's instructions. Assay Buffer was prepared fresh for screening and 

10 contained 20mM HEPES, pH 7.4, lOmM MgCfc, 20mM phospocreatine (Sigma), 0.1 
units/ml creatine phosphokinase (Sigma), 50 uM GTP (Sigma), and 0.2 mM ATP 
(Sigma); Assay Buffer was then stored on fee until utilized. 

Candidate compounds identified as per above (if frozen, thawed at room 
temperature) were added, preferably, to 96- well plate wells (3ul/weU; 12uM final assay 

15 concentration), together with 40 pi Membrane Protein (30ug/well) and 50ul of Assay 
Buffer. This admixture was then incubated for 30 r 



Following the incubation, lOOpJ of Detection Buffer was added to each WeD, 
followed by incubation for 2-24 hours. Plates were then counted in a Wallac 
20 MicroBeta"™ plate reader using Trot #3 1" (as per manufacturer instructions). 

A representative screening assay plate (96 well format) result is presented in 
Figure 1Z Each bar r e p r esents the results for a different compound in each well, plus 
RUP13-Gsct Fusion Protein construct, as prepared in Example 5(a) above. The 
representative results presented in Figure 12 also provide standard deviations based upon 
25 the mean results of each plate ("m") and the mean plus two arbitrary preference for 
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selection of inverse agonists as "leads" from the primary screen involves selection of 
candidate compounds that that reduce the per cent response by at least the mean plate 
response, minus two standard deviations. Conversely, an arbitrary preference for 
selection of an agonists as "leads" from the primary screen involves selection of 
5 c a n di d ate compounds that increase the per cent response by at least the mean plate 
response, phis the two standard deviations. Based upon these selection processes, the 
ca n did at e compounds in the following wells were directly identified as putative inverse 
agonist (Compound A) and agonist (Compound B) to RUP13 in wells A2 and G9, 
respectively. See, Figure 12. It is noted for clarity: these compounds have been directly 

1 0 identified without any knowledge of the endogenous ligand for this GPCR. By focusing 
on assay techniques that are based upon receptor function, and not compound binding 
affinity, we are able to ascertain compounds that are able to reduce the functional 
activity of this receptor (Compound A) as well as increase the functional activity of the 
receptor (Compound B). Based upon the location of these r ece p t o r in lung tissue (see, 

15 for example, hRUP13 and hRUP21 in Example 6), pharmaceutical agents can be 
developed for potential therapeutic treatment of lung cancer. 

References cited throughout this patent document, including co-pending and 
related patent applications, unless otherwise indicated, are fully incorporated heron by 
reference. Modifications and extension of the disclosed inventions that are within the 

20 purview of the skilled artisan are encompassed within the above disclosure and the 
claims that follow. 

Although a variety of expression vectors are available to those in the art, for 
purposes of utilization for both the endogenous and non-endogenous human GPCRs, it is 
most preferred that the vector utilized be pCMV. This vector was deposited with the 
25 American Type Culture Collection (ATCC) on October 13, 199S (10801 University 
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Blvd., Manassas, VA 201 10-2209 USA) under the provisions of the Budapest Treaty for 
the International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
Procedure. .The DNA was tested by the ATCC and determined to be viable. The ATCC 
has assigned the following deposit number to pCMV: ATCC #20335 1 . 
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CLAIMS 

What is claimed is: 

1. A G protein-coupled receptor encoded by an amino acid sequence of 
5 SEQ.ID.NO. :2. 

2. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 1. 

3. A ptasmid comprising a vector and the cDNA of SEQ.TDJNO.:]. 

4. A host cell comprising the plasmid of claim 3. 

10 5. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO. :4. 

receptor of claim 5. 
7. A plasmid comprising a vector and the cDNA of SEQJD.NO. 3. 
15 S. A host cell comprising the plasmid of claim 7. 

9. A G protein-coupled receptor encoded by an ainino add sequence of 
SEQJD.NO. :6. 

10. A non-endogenous, constitutively activated version of the G p 



11. Apbs 



g a vector and the cDNA of SEQJDJ'J0.:5. 



12. A host cell comprising the plasmid of claim I 

13. A G protein-coupled receptor encoded by en 
SEQJD.NO. :8. 

14. A non-endogenc 
25 receptor of claim 13. 
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15. A plasmid co mpris ing a vector and the cDNA of SEQ JDNO.:7. 

16. A host cell comprising the plasmid of claim 15. 

17. A G protein-coupled recep to r encoded by an amino acid se qu ence of 
SEQJD.N0.: 10. 

1 8. A non-endogenous, camstitotivery activated version of the G protein-coupled 
urn 17. 



19. A plasmid comprising a vector and the cDNA of SEQJD.NO.S. 

20. A host cell comprising the plasmid of claim 19. 

SEQJDJ>IO.:12. 

22. A non-endogenous, constitutively activated version of the G protei 



n 21 com p rising an amino acid sequence of SEQ.ID.NO 84. 

23. A plasmid comprising * vector and the cDNA of SEQ.ID.NO.: 1 1 . 

24. A boat cell comprising the plasmid of claim 23. 

25. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQID.NO.: 14. 

26. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 25 comprising an amino acid sequence of SEQJD.NO 88. 

27. A plasmid comprising a vector and the cDNA of SBQ JD.N0.:13. 

28. A host cell comprising the plasmid of claim 27. 

29. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQJDJsfO.:16. 

30. A non-endogenous, constitutively activated version of the G protem-coupkd 
receptor of claim 29 comprising an amino acid sequence of SEQJDJ^O.:92. 

31. A plasmid comprising a vector and toe cDNA of SEQ-ID-NO.:! 5. 



WO 01/36471 PCTAJSOO/31509 

32. A host cell comprising the plasmid of claim 31. 

33. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQJD:N0.:18. 

34. A non-endogenous, constitutivery activated version of the G protein-coupled 
5 receptor of claim 33. 

35. A plasmid comprising a vector and the cDNA of SEQ.ID.NO : 17, 

36. A host cell comprising the plasmid of claim 35. 

37. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQJD.NO.-.20. 

10 38. A Don-endogenous, constitutively activated version of the G protein-coupled 

receptor of claim 37. 

39. A plasmid comprising a vector and the cDNA of SEJD.NO.:l 9. 

40. A host cell comprising the plasmid of claim 39. 

41. A G protein-coupled receptor encoded by an amino acid sequence of 
15 SEQJD.NO.:22. 

42. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 4 1 . 

) 43. A plasmid comprising a vector and the cDNA of SEQ.EDNO. -3. 1 . 

44. A host cell comprising the plasmid of claim 43. 
20 45. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQJD.NO.:24. 

46. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 45. 

47. A plasmid co mprisin g a vector and the cDNA of SEQ.IDNO. :23. 
25 48. A host cell comprising the plasmid of claim 47. 
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49. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQJD.NO.i26. 

50. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 49. 

5 51. A plasmid comprising a vector and the cDNA of SEQ.ID.NO.l25. 

52. A host cell comprising the plasmid of claim 51 . 

53. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQJD.NO.:28. 

10 receptor of claim 53. 

55. A plasmid comprising a vector and the cDNA of SEQJD.NO. 27. 

56. A host cell comprising the plasmid of claim 55. 

57. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ID.NO. :30. 

15 58. A rxm-endogenous, constitutively activated version of the G protean-coupled 

receptor of claim 57. 
59. A plasmid comprising a vector and the cDNA of SEQJD.NO.:29. 
) 60. A host cell comprising the plasmid of claim 59. 

61. A G protein-coupled receptor encoded by an amino acid sequence of 
20 SEQ ID.NO. 32. 

62. A non-endogenous, constitutively activated version of die G protein-coupled 
receptor of claim 61 comprising an amino acid sequence of SEQ.ID.NO.^6. 

63. A plasmid comprising a vector and the cDNA of SEQ.IDNO.^5. 

64. A host cell comprising the plasmid of claim 63. 
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65. A G proton-coup le d receptor encoded by an amino a 
SEQJDJ40.J4. 

66. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 65. 

67. A plasmid comprising a vector and die cDNA of SEQ.IDJJO.i33. 

68. A host cell comprising the plasmid of claim 67. 

69. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQJD*JO.:36. 

70. A non-endogenous, ccnstituttvely activated version of the G protein-coupled 
receptor of claim 69. 



71 . A plasmid comprising • vector and the cDNA of SEQ.EDJJO.l3 5. 

72. A host cell comprising the plasmid of claim 71. 

73. A G protein-coupled receptor encoded by an amino acid sequence of 
SEQ.ED.NO.:38. 

74. A nan-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 73. 

75. A plasmid comprising a vector and the cDNA of SEQJD JJO. 31. 

76. A host cell comprising the plasmid of claim 75- ) ^ g ^ 

77. A G protein-coupled receptor encoded by an amino acid sequence of "* n *~ 
SEQJD.N0. :40. 

78. A non-endogenous, constitutively activated version of the G protein-coupled 
receptor of claim 77. 

79. A plasmid comprising a vector and the cDNA of SEQJDJMO.:39. 

80. A host cell comprising the plasmid of claim 79. 
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•tggeagccc 


agaatggaaa 


cmcc« 9 tttc 


acacccaact 




tctcctttaa 


cttcagttat 


ggtgatt.tg 


gaggacatga 
gc*ggo«tc» 


tgcr g q t ct 9 


gaccttcttc 
cggcatcggt 


gcagccaags 
aactttgtct 


titttgugt 


tgcgcaacct 


caccaatctg 


ctcattgcc* 


ctggtggcca 
tgggagcatg 


totctgctg 
gccacgtgct 


ccccttcgag 
ctgtgcctcc 


atggactaet 
gtcaactacc 


gtctccacea 


atgccttgct 


ggccattgcc 


attgacagat 




ggatgaatta 


tcaaacggcc 


tccttcctga 


tcc»ttctc» 


ttgccatccc 


atcggcttac 


tttgcaaeag 


•ag.gccagg 


agatgatctt 


ctgtggccag 


atctggcctg 


aa^tcctact 


tcctcttcat 


ctttggtgtc 


gagttcgtgg 


ctgtgctatg 


ccaggatctc 


ccggg.gctc 


tggttcaagg 


gagcagattc 
ctcacggect 


gc.agcggct 
•tgtgctgtg 


gcgctgccgc 
ctgggcaccc 


aggaagacgg 

ttcrtaeggtt 


ttccccactg 


tgttcgtga. 


ggaaaagcac 


tacctcactg 


atcgccatg* 


geaacagcat 


gatcaacaec 


gtgtgcttcg 


•tg»»gt«ct 


tc*ag*ag»t 


gatgctgctg 


cactggcgtc 


tccagtgctg 


accttgacct 


ciguccuc 


ggggtgcccs 


atcaggctga 


agtga 








<40C> 2 

Hat Ala Ala' Gin Asa 

1 5 

Pre Cln Asp Hla Ala 

20 

Tyr Asp L*u Pro Hat 

Pha Pha Ala Ala Lya 

SO 

Lao Val Cys Gly XI* 
65 

Tyx Lys Lys L«u Arg 



Gly Asn Thr Sar Pha Thr 



Glu Asp 
40 

Val II a 



Ila Sar Asp Pha L*u Val Ala II* lis 

105 

u Sar Trp 



100 



Asp Hat 
Ila Ala 

Pha II* 

75 

In Lao 

M 

Cys Cys I 
Glu His ( 



Pro Asn Pha 

Pb* Bar Tyr 
90 

Thr Lys Thr 
45 

L*u Ala Gly 
iO 

Ala Ala L*u 
II* Ala Asn 



r Val Sar l*u 



Laa Lys Pro Arg Hat Asn 



XI* Asp 
Tyr Gin 
Lau Ila 
XI* val 



Tyr Val Sar 
1*0 

Ala XI* Val 
Pha Leu XI* 



Ala Xla Pro 1 



L Asp C 

215 



I XI* 

i 

t Cln 



Glu Pb* 
Sar Arg 
XI* Arg 



Gin bra Tyr Tyr Lys Sar 

V*l Gly Pro Val Val Thr 1 
235 

Glu L*o Trp Ph* Lys Ala 1 



Asn Pro 
15 

Gly Asp 
Arg Thr 
XI* M*t 
Thr Arg 



Lau Cys 
Thr Asn 



Ph* Cys 
Tyr Pb* 



2 L r5 J 



' Pb* Thr II* Val Arg Asp I 



Ph* Vsl Lys Glo Lys HI* 



295 
Tyr L*u 



Thr Al* Pb* Tyr Vsl Val 



Arg Lys 
Cys Trp 
Thr Val 
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<211> 1260 
<212> DMA 
<213> Mocao s«pi*n» 



ctccocttty ccgg.gggta cctgccctct gattcccagg 
gctctcttgg tggctgtctg cctggtgggc ttcgtgggaa 
etccttcaca etgcttggaa aggaa*gcca tccatgatce 
agcctggctg atctctccct cctgctgttt tctyucctt 



gttscccctg 
cctcgtggat 
actcctggca 
ccaatgtaaa 
agtcacagtg 



tgaatgtgtc ctttgctcac 
actggagaac catcaccccg 
ecctgtgtgt gattggutc 



atctggactg tggctagcct 
catgaaggtg tggaaatgtg 
atgtttggta agetctacce 
tatttctgg* gagcttatga 
aaccagatac gctcaaagca 
ctcttgtggc tccccgaatg 
ccggccccac cacaaggttt 
gcaeatcctc tcatttttct 
togaaatgga tgataaceaa 
ggcaactcag agggtcttcc 
gaaaaagega aacceagctc 
! ctgacgtaga 



tccetcctct 
gcagttttgg 



octgtctcag 
ceatctecag 
ggeaaaggga 



<210> 4 

<211> 419 

<212> PUT 

<213> Homo sapiens 




agtctcagga aacaccagct 
astccccagc atccetacca 
aaactgag** ggcagagatt 
acacagtccc ttctgtacag 
caggggaagg rgtt*a*t*g 



1020 
1080 
11*0 
1200 
1260 
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S.r Pbe Ala Hi. 

20 

Gin Asp Trp Arg 

3* 

Val Gly Ph. Val 

50 

Ala Trp Lya Gly 
69 

Sar ls»u Ala Asp 



L.u Kis 

Thr II* 

Gly Asn 

Lya Pro 2 
10 

Lau S«r 1 

as 

r Lya Sar \ 

1 

• Ph. ria e 
t Ale Lya \ 



Ala Asp Sar Asn Sar Sar sar 

10 

Ph* Ala Gly Gly Tyr Law Pro 
25 

II* Pro Ala t 



55 



• Val II. Gly 

. II* His S*r 
75 

) m ri« sn 

90 

> Asp Lau Gly 
105 

r Cys Hat Ala 
i Ph. Met Tyr 



val Ala 
45 

II* L*U 



Hat. Asn val 
15 

Sar Asp S«r 

30 

Val Cys L*u 
Leu Bis Asn 
l Leu Asn L*u 

to 

> II* Arg Ala 
95 

» Val Cys Lys 
110 

x S*r Leu Thr 



II* Trp Thr Val i 



i Bis Gla Gly Val 
i Glu Phe Met s*r 



140 

> Sar val 1 

i 

> Glu Trp I 
i L*o Val f 



Met Ph. Gly ] 
Ph* Ala S*r I 



» Tyr Pro Lau 
: Ph* Trp Arg 



Ala Tyr Asp Gin C 

225 

Asn Gin II* Arg ! 



> Lya Ala 
» Val Leu 



Lys Arg 

Gin Val 

Trp Lau 

Ala Gly 
280 

Hat Ph* 

295 



: Leu Leu 2 
> Val Ala 1 



Pro Ala Pro 
S*r II* Scr 



Pro Gin Gly Ph* II* 



II* Ph* Lau Val I 

305 

Trp Lys Trp Hat I 
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<210> 5 

<211> 1014 

<212> DMA 

<213> Hobo aapi.n* 



gtggactgce tggatggcgc ctgcctggcc atcgacccgc tgcgcgtggc cecgctccc* 
etgtatgccg ecatcttcct ggtggggptg ecgggcaatg ccatggtggc etgggtggct 
gggaaggtgg cccgceggag ggtgggtgee *eetggttgc tccacctgge cgtggcggat 
ttgctgtgct gtttgtctct gcccatcctg gcagtgccca ttgcccgtgg aggccactgg 
ccgtatggtg cagtgggctg tcgggcyctg ccctccatca tcctgctgac catgtatgcc 



tggtggtcta cggttcsgcg ggcgtgcggg gtgcaggtgg cctgtggggc agcctggaca 
ctggccttgc tgctcaccgt gccctccgcc etrteccgcc ggctgcacca ggagcacttc 
ceagcccggc tgcagtgtgt ggtggactac ggcggctcct ccagcaccga gaatgcggtg 
aetgccatce ygtttctttt tggcttcetg gggcccctgg tggccgtgoc cagctgccac 
agtgccctce tgtgctgggc agcccgacgc tgeeggccgc tgggcacagc cattgtggtg 
gggttttttg tctgctgggc ecccteccac ctgctggggc tggtgctcac tgtggcggce 
ccgaactccg cactcctggc cagggccctg cgggctgaae coctcategt gggcettgcc 
ctcgcteaca gctgcetcaa teccatgctc ttcetgtatt ttoogagggc teaactccgc 
cggtcactge cagctgcctg tcactgggcc ctgagggagt cccagggcc* ggacoaaagt 
gtggacagca agaaatccac cagccatgac ctggtctcgg agatgg*ggt gtag 
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180 
2«0 
300 
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<*00> 6 

Met Gly Asn Asp S*r Val S«r Tyr Glu 



S*r Asp Arg Pro val 

20 

Pro Lau Arg Val Ala 

35 

Gly Val Pro Gly Asn 
50 

Arg Arg Arg v«l Gly 
65 

Leu L*u Cys Cys L*u 
85 



Pro I 
Ala t 



c Gly 



25 

Pro Leu 
40 

Val Ala 
Trp Lau 
Pro II* 

Ala val 

105 



Tyr Gly Asp Tyr S*r Asp Leu 
10 15 

Gly Ala Cys L*u Ala II* Asp 

30 

Tyr Ala Ala II* Ph* L*u val 
45 

Trp Val Ala Gly Lys Val Ala 
60 

Leu His Leu Ala Val Ala Asp 
75 80 

Leu Ala val Pro Zl* Ala Arg 
90 95 



i Thr 
> Ala 
J ASB 



215 

J Cys 
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> Ala 
» S«r 



Gin val J 

Pro S*r I 
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Lau Gla C 
185 

Val Thr i 
200 

Val Ala 2 

Pro Tyr I 



r Arg Ala € 
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I Arg Glu £ 



i Mr Thr Sar His t 
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<2J0> 7 

<211> 1272 

<212> DMA 

<213> Hooo *«pi«ns 

<400> 7 

atgttgtgtc accgtggtgg ecsgctgata gtgccaatca tcecactttg cectgagcac €0 

tcctgcaggg gtegaageet eeagaacctt ctctcaggcc catggcccaa gcagcccetg 120 

g»»crtc»t» acctgagcte tccatctccc tctctctcct cctctgttct cccteeetcc 180 

ttctctccct caccctcctc tgetccctct gcctttacca ctgtgggggg gtcctetgga 2 40 

gggccctgcc accccacetc ttectcgetg gtgtctgcet tcctggcacc aatcctggcc 300 

ctggagtttg tcctgggcct ggtggggaee agtttggccc tcttcatctt ctgeatccae 360 

•cgeggccct ggacctccaa eacggtgttc ctggtcagcc tggtggccgc tgaettcctc 420 

ctgatcagca occtgcccct ccgcgtggac tactacctcc tccatgagac ctgjcgcttt 480 

ggggctgecg cctgcaaagt caaxctcttc atgctgtcca ccasccgcac ggccagcgtt 540 

gtcttcctea cagccatcgc actcaaccgc taectgeagg tggtgcagcc ccaccaegtg €00 

ctgagccgtg cttccgtggg ggeagctgec egggtggccg ggggactctg ggtgggcatc 660 

ctgctcctca acgggcacet gctcctgagc accttctccg gcccctcctg cctcagctec 720 

agggtgggca cgaagccctc ggcetcgctc egctggcacc aggcactgta cctgctggag 780 

ttcttcctgc cactggcgct catcetcttt gctattgtga gcattgggct caceatccgg 640 

aaccgtggtc tgggegggca ggcaggeccg cagagggcca tgcgtgtgct ggccatggtg" 900 

gtggccgtct acaccatctg cttcttgccc agcatcatct ttggcatggc tfcccatggtg 960 

gctttctggc tgtccgcctg ecgatccetg gacctctgca eacagctctt ccatggctcc 1020 

etggecttca cctacctcaa cagtgtectg gaccccgtgc tctactgctt etet«gecee 1080 

MCttcctce accagagccg ggccttgetg ggcctcacgc ggggccggca gggcccagtg 1140 

agcgacgaga gctcctacca accctccagg cagtggcgct accgggaggc ctctaggesg 1200 

gcggaggcca tagggaagct gaaagtgcag ggcgaggtet ctctggaaaa ggaaggctcc 1260 

tcccagggct ga 1272 



<210> 8 

<211> 423 

<212> PAT 

<213> Hcno sapiens 

<400> 8 

Mat Lan Cya Hia Arg Gly Gly Gin Lau J la Val Pro Ila I la Pro Leu 



Cys Pro Glu His Sax Cya Arg Gly Arg Axg Uo Gin Aon Lau Lan Ser 



WO 01/36471 



20 



2S 



30 



Gly Pro Trp Pro Ly» Gin Pro Hat Glu Lao Bis A*n Leu Scr Sax Pro 
35 40 45 

Sax Pro Sar Lau Sax Sax Sax Val Lag Pro Pro Sax Phe Sax Pro Sar 
SO 55 60 

Pro Sar Ser Ala Pro Sar Ala Pha Thr Thr Val Gly Cly Sar Sar Gly 

(5 70 75 BO 

Gly Pro Cya His Pro Thx Sax Sar Sax Lao Val Sar Ala Pha Lau Ala 
85 SO 95 



Thx Ala Sar Val Val Pha Lau Thr Ala Ila Ala lau Asn Arg Tyx Lau 



Ala Ala Arg Val Ala Gly Gly Lau Trp Val Gly Xla Lau Lau Lau i 



PCT/US00/31S09 



<210> 9 
<211> 966 
<212> 



<213> aom 


j aapiana 






<400> 9 
atgaaccaga 


ctttgaatag 


cagtgggacc gtggagtcag ccctaaacta ttecagaggg 


60 


agcscagtgc 


acecggccta 


cctggtgctg agctccctgg ccatgttcac ctgcctgtgc 


120 


gggatggcag 


gcaacagcat 


ggtgatctgg ctgctgggct ttcgaatgca caggaacccc 


180 


ttctgcatct 
tccaegctca 
atgaagagac 


atatccteaa 
gcctggaaae 
tgatgtactt 


cctggcggca gccgacctcc tcttcctctt cagcatggct 
ccagcccctg gtcaatacca ctgacaaggt ccacgagctg ' 
tgcctacaca gtgggcctga gcetgctgec ggc cat cage 


240 
300 
360 


acccagcgct 


gtctctctgt 


cctcttccct atctggttca agtgtceecg gcccaggcac 


420 


ctgtcagcct 
tetteettct 
gtccaggccg 


gggtgtgtgg 
gcagcaagtt 
ccctcateat 


cctgctgtgg ecactctgtc tcctgatgaa cgggttgacc 
cttgaaattc aatgaagatc ggtgcttcag ggtggacatg 
gggggtccta aceccagtga tgaetctgtc cagcctgacc 


400 
540 

600 


ctctttgtct 


gggtgcggag 


gagctcccag cagtggcggc ggcagcccac acggctgttc 


660 


gtggtggtcc 


tggcctctgt 


cctggtgttc ctcatctgtt ccctgcctct gagcatetac 


720 


tggtttgtgc 


tctactggtt 


gagcctgccg cccgagatgc aggtcetgtg cttcagcttg 


790 


tcacgcetct 


cctcgtccgt 


aagcagcagc gecaaccccg tcatctectt cctggtgggc 


840 


agecggagg* 


gccacaggct 
ccgagctgga 


gcccaccagg tccetgggga ctgtgetcca acaggcgett 
aggtggggag acgcccaccg tgggcaccae tgagatgggg 


900 
960 


gcttga 






966 


<210> 10 
<211> 321 
<212> PUT 
<213> HOOK 


> s apian* 






<400> 10 








1 


i Thx Lau A! 
5 


in sar sar Cly Thx val Glu Sex Ala Lau Asn 
10 15 





Tyx Sar Axg Gly Sar Thr val His Thr Ala Tyr Lau Val Lau Sar Sar 



WO 01/36471 



Lau Ala Hat Pha Thr Cys Lau Cys Gly Hat Ala Gly Asn Sar 
35 40 45 

Ila Trp Lau Lau Gly Pha Arg Mat His Arg Asn Pro Phe Cys 
50 55 60 

II* Lau Asn Lau Ala Ala Ala Asp Lau Lau Pha Lau Pha Sar 

CS 70 75 

Sar Thr Lau Ser Lau Glu Thr Gin Pro Leu Val Asn 



n Gly 
i Clu Asp Arg 



Mat Val 

Ila Tyr 

Mat Ala 
BO 

c Asp Lys 
95 

t Val Gly 

r Val Leu 

i Ala Trp 

Lau Thr 
160 

CyaPha 

Thr Pro 

I Trp Val Arg Arg Ser 

Val Leu 

) Pro Lau Sar lie Tyr 
I 240 

> Glu Hat Gin Val Lau 



i Lau Axg Glu 
300 

n Clu 



Leu Pro 
Clu Pro 



<210> 11 

<211> 1356 

<212> DMA 

<213> Reno sapiens 



: ccactttggg gagggtccct 



PCT/US00O15W 



tcggeatctg 
gccgctytg* 
cacetctgcc 
agctctgccc 
ctgagcgtgt 
tactattacg 
tctgtgetgg 
agggtctcct 
cacagtgcct 
ctgctcctca 



gtccctctec 
tggccctctt 
tggccgtgat 
tggtggacct 
tctttgacca 
gctttgtceg 

tgggtgtgtg 



tgceagtggg 
cttcaigctc 
cgcceagacg 
gctggctgce 
cgccctcttt 
cctggccatc 
catgcgctac 
ggtgaaggcc 



ctgctggact 
ectgccctcc 
ctgaccctc* 

ggggeggtgg 

ctctcggtgt 



actgccegct 
tecttgtggt 
cgctgcccac 
cgetggtcac 
aagcagcagt 
tctctttcca 



tggggctacg 
tgaccgctgt 
geeeatttgt 
tgcccctggc 
cctgccgcct 
cagccatcea 
tgacgctggg 
ettctgtgce 
gctgttcact 
tectttactt 



ggatgttgct 
ggctggcaet 
cttcgtcttc 
catgctetee 
ctaettgttt 
tgtggegcyc 
gctggtggcc 
agtgttggga 



eagctcgggg 
ggttctcctg 
CCtctatgtt 
ctggattggc 
tetggetgtc tuaccggc* gatccggggg 



gtctttgctg 
atgttccgag 



gccccccaga ccaccccaca 
gctgtggggg gacagttcct 
gccctgagtg ctcagcccet 



ttcctgeegt tccttcaggg 



agttcctgga 
ccgcctcacc 



gactggctgt 
•cctgctgtt 

ictc 

ceggctggag tcatga 



gagctcagca agcagtttgt 
cctagccggg agggctccat 
ccttctgagt cctgggtttc 
gactttcgaa t< 



atctctcagc 
ecggaegttt 
gctctgttyg 
ttcaactggg 

ccctttcttc 
ctgeteettc . 



iggcce gatagctgag 



gagecctctg 
ctccgtcctg 

<210> 12 

<211> 451 

<212> PUT 

<213> Rcao sapiens 

<400> 12 

Met Glu Ser Ser Pro II* Pro Gin Ser Ser Gly Asn Sec Ser Thr Leu 

IS 10 IS 

Gly Arg Val Pro Gin Thr Pro Gly Pro Ser Thr Ala Ser Gly Val Pro 
20 25 30 

Glu Val Gly Leu Arg Asp Val Ala Ser Glu Ser Val Ala Leu Phe Pha 
35 «0 45 

Met Leu Leu Leu Asp Leu Thr Ala Val Ala Gly Asn Ala Ala Val Met 



50 



55 



60 



Page 12 



240 

300 
360 
420 
460 
540 
600 
660 
120 
TOO 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1356 



FCT/USOO/31509 



Ala 

65 ■ 

His 
Ala 
val 



Ser I 
Pro < 



Val He Ala Lys 
Leu Cya Leu Val 



Thr Pro Ala Lou Arg Lys Phe Val Phe val Phe 



Asp Leu Leu Ala Ala Leu 

N 

Ser Ala Leu Phe Asp His 
105 

ser Val 
Val Glu 



i Pro Met Arg Tyr 
150 

f Val 



I val 

Pro Gly Cys Ser Leu Gin 
195 

, Val 



120 

i He Asn 
i Val Arg 
> val Lys 



I Val Val Tyr Cys I 



: Arg 
I Arg 

} Gly 



215 

c net 

) 

Ser 

Thr 



Tyr Phe 
Phe Arg 
Trp Met 

Thr Met 

265 



: Phe His Leu Tyr \ 



Ser ) 
Arg 1 



> Gin Gly Thr Gly 

i 

> Ser Pro Lys Gin 



I Val Thr Trp lie 

I 

: Gly Cys Leu Asn 
345 

. Cys Pbe Phe Lys 

360 

t Glu Gly Ser lie 
375 

i Pro Ser Glu 2 



Ala Leu 

170 

Glu Gly 
Ser Ala 
Leu Leu 



Val Thr 

Gly Gly 

Leu Cya 

Ala Gin 
315 

Gly Tyr 
Arg Gin 
Pro Ala 
Glu Glu 



Thr Leu 
Ala Leu 

Cys Phe 

125 

Arg Tyr 
140 

Leu Gly 



Pro Leu 

220 

Arg Val 
Pro Arg 



Trp Leu 

300 

Pro He 

Phe Cya 

He Arg 

Pro Glu 
365 

Asn Phe 

3S0 



110 

Val ser Leu 
Tyr Tyr Val 

Leu Val Ala 

160 

Ala Ser Val 

175 

Ser Val Pro 
190 

Gin Leu Pbe 
Leu Lev He 



Ala Val val 
Pro Tyr Phe 



Phe Thr Ser 



Glu Glu Leo 
Leu Gin Phe 



Glu Pro Pro Ala \ 



PCT/USOO/31509 



Leu Glu Ser 
450 

<210> 13 

<211> 1041 

<212> DMA 

<213> Bono « 



ggcaccttca gcaataacaa cagcaggaae tgcacaattg eaeacttcae gagagaattt 
ttcccaattg tatatctget aatatttttc tggggagtct tgggaaatgg gttgtcceta 
titgttttcc tgcegectta taagaagtcc acatctgtga acgttttcat geteaatctg 
gccattteag atctcctgtt cataegcacg ettcccttca gggetgacta ttatcttaga 
ggctecaatt ggatetttgg agacctggcc tgcaggatta tgtcttattc cttgtatgtc 
aacatgtaca gcegtattta tttcctgacc gtgctgagtg ttgtgcgttt cctggeaatg 
gttcacccct ttcggcttct gcatgtcaec agcatcagga gtgcctggat cctctgtggg 
atcatatgga tcettatcet ggcttcctca a teat get cc tggecagtgg ctctgagcag 
aacggeagtg tcscatcetg cttagagctg eatctctata eaattgetea gctgcagacc 
atgaactata ttgccttggt ggtgggctgc ctgetgeeet ttttcacact cagcatctgt 
tatctgetga tcattcyggt tctgttaaaa gtggaggtcc cegaatcggg getgegggtt 
tctcacagga aggcactgac caccatcatc atcacettga tcatettctt cttgtgtttc 



gaeagaetgc ataeagcttt ggttateeea ctggccttgg cagcagccaa tgeetgette 
aatcctctge tctattaett tgctggggag aattttaagg acagactaaa gtctgcactc 
I aagtgtgttt tccctgttag tgtgtggttg 



120 
1B0 
240 
300 
360 
420 
4*0 
540 
600 
660 
720 
7B0 



960 
1020 
1041 



<210> 14 

<211> 346 

<212> PRT. 

<213» Bono sapiens 



Met Glu Arg Lys Phe Net Ser Leu Gin Pro Ser He Ser val Ser Glu 

13 10 15 

Met Glu Pro Asn Gly Thr Phe Ser Asn Asn Asn Ser Arg Asn Cys Thr 



PCTAJS00/315O9 



He Glu Asn Phe Lys Arg Glu Phe Phe Pro He Val Tyr Leu He He 
35 40 45 

Phe Phe Trp Gly Val Leu Gly Asn Gly Leu Ser He Tyr Val Phe Leu 
50 55 60 

Gin Pro Tyr Lys Lys Ser Thr Ser Val Asn Val Phe Met Leu Asn Leu 
65 70 75 B0 

Ala He Ser Asp Leu Leu Phe He Ser Thr Leu Pro Phe Arg Ala Asp 
85 90 95 

Tyr Tyr Leu Arg Gly Ser Asn Trp He Phe Gly Asp Leu Ala Cys Arg 

100 105 110 



<210> 15 

<211> 1527 

<212> DMA 

<213> Hono * 



WO 01/36471 



ctctcci»»§ tgcccatcag 
ttcctcgccg cctctttcgt 
cagctgctge •ggtgacca* 
■tttcgcTcg tggccccetg 



c*gt»cgcge 
cctggcccac 
cggcaocst* 
ccgttttatc 



g*gogt»«ca 

ggcatcatcc 
gtgctggcgc 
tttaaectcc 
acctctgtge 



gcagccacac gtgcatgccc 
gctc«»cc9t gctggttate 
tagtgttgca gegoagceg 



:*ttgtcg tggtgtcagt 



ggatcgctac 



etetcttctg gccectcaac 
tcgecttcgc cagegtcaac 



ttgtceatca 
ctcetctatg 

ctgcagagca ctcctccoct ctacggctgg ggccaggctg 
ctctgctcca tgatctgggg 
atcgtcattc cactgattgt 
aggcagcatg ctetgctgta 
tgtgtggaga atgaggatg* 
gagtttcgcc gccagcatga 



catgattgcc 
caatgteaag 



tgtggccatc 
cctttgatga gcgcaatgct 
ttctcagcgt ggtgtccttc 
tggtgttctg tgcagcccgg 
tggaagtgcg agtcaaggac 
aogagttcca ggatgagagt 



» ggaaggaagc acggggacca gtgagagtag tgtagaggcc 



gtcaaccagt gcagcattga cttgggtga* gatgacatgg agtttggtga agacgacatc 
•atttcagtg aggatgacgt cgaggcagtg -aacatcccgg agagcctecc acceagtcgt 
cgtaacagca acagcaaccc tcctctgccc aggtgctacc agtgcaaagc tgctaaagtg 
atcttcatea tcattttctc ctatgtgcta tcectggggc ectactgct* tttagcagte 
etggccgtgt gggtggatgt cgaaaeccag gtaccccagt gggtgatcac cataatcatc 
tggcttttet tcctgcagtg ctgcatccac ccctatgtct atggctacat gcacaagaec 
attaagaagg aaatccagga catgctgaag aagttcttct gcaaggaaaa gcccccgaaa 



tcctacgatt ctgctacttt tecttga 

<210> 16 

<211> SOS 

<212> PUT 

<213> Homo saplan* 

<400> 16 

Mac Thr Ser Thr cy» Thr Asn Ser Thr Arg Glu S*r ton Sar Ser His 



120 
1*0 
240 



420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
10B0 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1527 



WO 01/36-171 

Thr Cy» Hat Pro Lao Sar Lys Net Pro II* Sar Leu Ala Bis Gly 11a 
20 25 30 

II* Arg 5«r Thr Val Lao Val I la Phe Lav Ala Ala Sar Pha Val Cly 
35 40 45 

ton I la Val Leo Ala Lau Val L*u Gin Arg Lys Pro Gin Lau Lau Gin 
50 55 60 

Val Thr Asn Arg Pha Jla Pha ton Leo Lau Val thr top Its Uu Gin 
65 70 ?5 80 

Xla Sar Lau Val Ala Pre Trp Val Val Ala Thr Sar Val Pro Lau Pha 



PCT/US00V31509 



85 



90 



95 



PCT/US00/31509 



c Arg Arg ton Sar ton 
380 

i Ly» Ala Ala Lys Val 

I 400 

r Lau Gly Pro Tyr Cys 



i Thr Zla Lys Lys Glu 



'83 6 



<210> 17 

<211> 1068 

<212> DNA 

<213> Homo sapiei 



atatttgtct gggttatagc tttcattaec tgctttggaa 
agatctttca ttaaagctga aaatacaact caegctatgt 
gctgactgcc tgatgggtgt ttacttgtte tttgttggca 
gggcagtate agaagtatge ettgctgtgg atggagagcg 
ttcctggeca tgctgtccac cgaagtctet gttetgetae 
aagttcctgg tcattgtctt ccccttcagt aacattcgac 
gteatcctca tttgcatctg gatggcggga tttttaatag. 
asggattatt ttggaaactt ttatgggaaa aatggagtat 
caaacsgaag atattggaag caaagggtat tctcttggaa 
ctggcttttc tcatcettgt gttttcctat atteetatyt 
gecttgeaga ccacagaagt aaggaattgt tttggaagag 
ttctttttta tagtgttctc tgstgecate tgctggsttc 
ctttccctct tccgggtgga aataccagac aeaatgactt 
cttccagtta acagtgcttt gaatccaate ctctatectc 
' gscaagttga aacagctget gcacaaacat cagaggaaat 



tggctaaeaa tatcctcaga 60 

atctttttgt cattggcatg 120 

ccatcaaaat cctttgttgc 180 

tcttcgatat aaaataccga 240 

tgcagtgecg cctcatgggg 300 

tgacetactt gactttggag 360 

ctggaaaacg gcagacctc* 420 

ctgtaattcc attttggaat 480 

gtttcccact ttattatgac 54 0 

ttttcctagg tgtgaacttg 600 



•ggtggctgt tgcaaatcgt 720 

ctgtatttgt agttaaaatc 780 

cetggatagt gatttttttc 84 0 

tcacaaecaa cttttttaag 900 

:taaaaaa 960 



aaaagtttat etacatccat tgtgtggata gaggactcct 
ttgaacaasa taacacttgg agaeagtata etgaaaccag 

<210> 18 

<211> 355 

<212> PRT 

<213> Bono sapiens 

<400> 18 

Hat Pro Lau Tor top Gly 11a Sar Sar Pha Glu * 

IS 10 

ton II* Lau Arg Zla Pha Val Trp Val Ila Ala I 

20 25 

Gly Asa Lau Phe Val Ila Cly Mat Arg Sar Pha I 
35 40 

Thr Thr Bis Ala Mat Sar Ila Lys Ila Lau Cys 

50 55 

Mat Gly Val Tyr Lau Pha Pha Val Cly Ila Pha 

65 70 75 

Gly Gin Tyr Gin Lys Tyr Ala Lau Lau Trp Hat 
85 90 



cttccctgaa acttggggtt 1020 
tttcctag 1068 



15 

Thr Cys Pha 

Ala Glu Asn 

Asp Cys Lau 

Lys Tyr Arg 

val Gin Cys 
95 

Sar val Lau 

110 

i val Pha Pro 

t 

I Ila Lau II* 



Cys Al. 
Asp lie 
Glu Sar 
Glu Val 



. Ila Pro 
j cly val 
- Lys Gly 

i Ila 

205 

r Ala 

> 

t Ala val > 

i 

• Trp Ila 1 

t Ila Pro i 

2 

Thr Sar Trp Ila Val Ila Pha Pha Lau Pro Val ton Ser 

Paga 19 



Tyr Sar Lau 
190 

31* val Pha 

Lau Gin Thr 



Ala Lau Asn 



<J10> 19 

<211> 969 

<212> OKA 

<213> HORO 



:aatceacgg a*ctg»gg»g £0 

•ctctttjct ac»»gc»gac cttgagcctc »cggtgctg» cgtgcatcgt ttcccttgtc 120 

gggctgecag gaaacgcagt tgtgctctgg ctcctgggct gccgcatgcg caggaecgec 180 

ttctcc«tct acatcctcaa cttggccgca gcagecttcc tcttcctcag cggccgectt 240 

atatattccc tgtteagctt catcagtatc ccccatacca tctctaaaat cctcntcct 300 

gtgatgatgt tttcctactt tgcaggcctg agctttctge gtgccgtgag caccgegcgc 360 

tgcctgtccg tcctgtggcc catctggtac cgctgccacc gccccacac* cctgtcagcg 4 20 

gtggtgtgtg tcctgctctg ggccctgtcc ctgctgcgga geatcetgga gtggatgtta 480 

tgtggcttcc tgttcagtgg tgctgattct gcttggtgte aa*catcaga tttcatcaca 54 0 

gtcgcgtggc tgattttttt atgtgtggtt ctctgtgggt ccagcctggt cctgctgatc 600 

eggattctct gtggatcccg gaagataccg ctgaccaggc tgtacgtgac catcctgctc 660 

acegtactgg tcttcctcct ctgtggcctg ccctttggca ttcagttttt cctattttta 720 

tggatccacg tggacaggga agtcttattt tgtcatgttc atctagttcc tattttcctg 780 

tccgctctt* acagcagtgc caaccccatc atttacttct tcgtgggctc cttteggcag 840 

ctjtcoaaota ggcagaacct gaagctggtt ctccagaggg ctctgcagg* cgcgtctgag S00 



gtggatgaag gtggegggc* gcttcctgag geaatcetgg agctgtcggg aagcagattg 
gagcegtga 



960 
969 



<210> 20 

<211> 322 

<212> PRT 

<213> Boom aapiu 



WO 01/36471 

Mat Asp Pro The XI* Ser Thr Leu Asp Thr Clu Leu Thr Pro XI* Asa 
15 10 is 

Gly Thr Gla Clu Thr Leu Cys Tyr Lys Gin Thr Leu S»r Leu Thr v»l 

20 25 30 

Leu Thr Cys XI* Val S«r Leu Val Cly Leu Thr Gly Asn Ala v»l Val 
35 40 45 

Lao Trp Leo L*u Gly Cys Arg Mat Arg Arg Asn Ala Ph. S«r He Tyr 

50 55 60 

II* Leu Asn Leu Ala Ala Ala Asp Phe Leu pfaa Leu S*r Gly Arc Leu 
65 70 75 80 

II* Tyr S*r Leu Leu S«r Ph* Ila Sar II* Pro His Thr II* S*r Lys 



FCT/US00/31509 



90 



95 



Gly Gly Gin Leu Fro Glu Gin II* Leu Gin Leu 5er Gly Set Arg Leu 



PCT/USOO/31509 



<400> 21 

atggaggatc tctttagccc cteaattctg ccgccggcgc cceacatttc cgtgcccatc 

ttgctgggct ggggtctcaa cctgaccttg gggcaaggag cccctgcctc tgggccgccc 

agecgecgeg tccgcetggt gttcctgggg gtcatcctgg tggtggcggt ggcaggcaac 



aagatggact tcctgctggt geagctggec ctggcggace tgtacgcgtg egggggcacg 
gcgctgtcac agctggcctg ggaactgctg ggcgagcccc gcgcggccac gggggacctg 
gcgtgecgct tcctgcagct getgeaggea teegggeggg gcgcctcggc ccaccccgtg 
gtgetceteg ccctcgagcg ccggcgcgcg gtgegtctte cgcacggccg gccgctgccc 
gcgegtgccc tegccgccct gggctggctg ctggcactgc tgctggcgct gcccccggcc 
ttcgtggtgc geggggaetc cccctcgccg ctgccgccgc egccgccgcc aacgtccctg 
cagceaggcg cgcccccggc cgcccgcgcc tggccggggg agegtegctg ccacgggatc 
ttcgcgcccc tgccgcgctg gcacctgceg gtctacgcgt tctacgaggc egtegeggge 
ttcgtcgcgc ctgttecggt cctgggcgtc gcttgcggcc ecctactcte cgtctggtgg 
cggcaccggc cgcaggccec cgcggctgca gcgccctggt eggegagece aggtcgagee 
cctgcgccca gcgcgctgcc ccgcgccaag gtgeagagee tgaagatgag ectgetgetg 
gcgctgctgt tcgtgggctg cgagctgcec tactttgecg tccggctggc ggccgcgtgg 
tcgtccgggc ccgegggag* ctgggaggga gagggectgt eggeggeget gcgcgtggtg 
gcgatggcc* acagcgctct caatcccttc gtctacctct tcttccaggc gggcgactgc 
cggctccgge gaeagctgeg gaagcggctg ggctctctgt gctgcgcgec geagggaggc 
gcggaggacg aggaggggee ccggggccac caggcgctct accgccaacg ctggececac 
cctc*tt*tc accatgctcg gcgggaaccg ctggacgagg gcggcttgcg cccaecccct 
ccgcgcccca gacccctgce ttgetcctgc gaaagtgcet tctag 

<210> 22 
<211> 434 
<212> PRT 

<213> Homo saplans 
<400> 22 

M*t Glu Asp lent Ph* Sar Pro Sar XI* Leu Pro Pro Ala Pro Asn II* 

IS 10 15 

S«r Val Pro 11a Leu Leu Gly Trp Gly Leu Asn Leu Thr Leu Gly Gin 

20 25 30 

Gly Ala Pro Ala S«r Gly Pro Pro Sar Arg Arg Val Arg Leu Val Pb* 



120 
180 
240 
300 
360 
420 



960 
1020 
1080 
1140 
1200 
1260 
1305 



PCT/USOO/31509 



L*u Gly val II* Leu Val Val Ala Val Ala Gly Asn Thr Thr Val Leu 

50 55 60 

Cys Arg Leu cys Gly Gly Gly Cly Pro Trp Ala Gly pro Lys Arg Arg 

65 70 7$ 80 

Lys Met Asp Ph* Leu Leu Val Gin Leu Ala Leu Ala Asp Leu Tyr Ala 
85 90 95 



<210> 23 




<211> 1041 


<212> DMA 




<213> BOM 


> sapiens 


<400> 23 




atgtacaacg 


ggtcgtgetg 


ctgctcattg 


tggccittgt 


tgcttccaea 


cgaagacctg 


gatttcctcc 


ttetgatctc 


tgggettttg 


gggacattce 


gggagcatcg 


tgttccttac 


caccacgcgg 


tgaacactat 


gccctggtca 


tcetgggaac 


acggccgtct 


cctgtgagag 


ttccagctgg 


•gttctttet 


tggagcctga 


ggcggaggca 


ttcatcatgg 


tggtgccaat 


ctctatttec 


tctggacggt 


caeataacce 


tcagcttcac 


tcaagcccct 


cctttcccaa 






sggegttgca 


tcagtgtggc 


cacattgttg 


agtggcaetg 



ccgcatcgag 


ggggacacca 


tctcceaggt 


gatgccgccg 


60 


gctgggcgca 


ctaggcaatg 


gggtcgccct 


gtgtggttte 


120 


gsageccagc 


artgtttacc 


ttttc**ttt 


ggccgtggct 


190 


cctgcctttt 


cggacagact 


ettacctcag 


acgtagacac 


240 


ctgcegegtg 


gggetcttca 


cgttggccat 


gaacagggce 


300 


ggtggtggct 


gcggacaggt 


etttcaaagt 


ggtccacccc 


360 


ctccacccgg 


otggcggctg 


gcatcgtetg 


caccctgtgg 


-20 


agtgtatctt 


ttgctggaga 


occatctctg 


cgtgcaagag 


480 


cttcatcatg 


gagtcggcca 


atggctggca 


tgacatcatg 


540 




atcatcttat 


tttgctectt- 


caagattgtt 


600 


gcagctggcfc 


agacaggctc 


ggatgaagaa 


ggcgacccgg 


660 


tgtgttcatc 


acatgctacc 


tgcccagcgt 


gtctgctega 


120 


gccctcgagt 


gcctgcgatc 


cctctgtcca 


tggggccctg 


780 


ctacatgaec 


agcatgeegg 


atcccctggt 


gtettatttt 


840 


attctecaac 


aagctcaaaa 


tctgcagtct 


gaaacccaag 


900 


acaaaggccg 


gaegagatgc 


caatttcgaa 




960 


aaatagtttc 


caaagccagt 


ctgatgggca 


■tgggatccc 


1020 










1041 



<210> 24 

<211> 34 6 

<212> PRT 

<213> Bono sapiens 

<40C> 24 

Met Tyr Asn Gly Ser Cya Cys Arg Ila Clu Gly Asp Thr XI* Ser Gin 

13 10 15 

Val Mat Pro Pro Leu Lau Ila Val Ala Phe Val Leu Gly Ala Lea Gly 



PCT/USOW31509 



Mat Ila Cya 
63 

Trp Ala Ph* 
Met Asn Arg 



Thr Ala Val 
His Asp Ila 



Ala Lao 

val Tyr 

Lau Pro 

Gly Asp 
83 

Ala Cly 
100 

> Lya Val 

i 

l Ala Ala 
r Val 1 
Ser ( 



33 

Ph* Arg 

70 

lie Pro 
Ser Ila 
Val His 



150 
l Glu Ser 



Ph* Cya I 

40 

A*n Lao I 
Thr Asp 1 

Cya Arg > 

? 

Val Phe I 

105 



» Hi a Mat 

t val Ala 
60 

'V? 

I Gly Lata 

> Thr Val 

> Ala Val 



Lys Thr Trp Lya 



Leu Arg Arg Arg Bis 



Pha II* Mat 



155 
Clu Sar J 



> Ala Lau val ] 

> 

* Cys Val Gin ( 



i Pha Mat Pro 1 



t Trp 
i 

t II* 



Sar Lya Thr C 
305 

Arg Sar Cys 3 

Gin Trp Asp I 

<210> 25 

<211> 1011 

<212> DMA 

<213> Hoiao 



I Ser Ph* Lys Ila 

i 

I Gin Ala Arg Met 
215 

• val Pha Ila Thr 

230 

» Leo Trp Thr Val 
245 

i II* Thr 

val Tyr Tyr 

Lys He Cys 
295 

Arg Pro Glu 



Val Trp S 
200 

Lys Lytf f 
Cya Tyr 1 
Pro Sar ! 



r Ala Cya J 
i Thr Tyr K 



r Val Sar Ala Arg 
240 

S*r Val 



r Val Ala 

i 

> Ila Val 



Glu Mat 
Asn Sar 



Lys I 
Pro 1 



t Gin Pro Gly 

) 

c Asn Lou Gly 



Arg 

320 

Gly 



atttacatcc 



tgggeetatc 
gcaagtatgt 



gaaggagaag 

attgcaggtc 
tactectctt 
gatttgactt 
tgettccttc 
tgtcagcaat 
agaagtagca 
tataatctct 



atacaacatg 
tcctttgtat 
taggtaaaaa 
tgtgcagtgc 
aatetgctca 
ttgtcegtct 
aggattcetc 
aaaaatttcg 
tgggcataat 
agagcctatg 
tcattggaac 
ttgtaagcca 



tattcaacca 
tgttggtgtt 
•acatcaacg 
catgcctttc 
atgcagagtg 
cttaatttta 
gcaagagaet 

cattccagtt 
ctacaatcgg 
cecatttatt 
tctgagaaaa 



ttttaaaccc 



cttatagtat 
tgaattattt 
cagaccccat tatatttctt 



tctatgatct 
tttggaaac* 
cacatctacc 
atgagtatct 
gtcaattttc 
agttggattg 
acttcatgct 
tttgctaga* 
accgtatact 
cagatggaac 
ggattttcct 
ataagaacet 
cttttggtca 
attttttatg 
aaaaacattc 
ttattagat* 
catatgcaat 



cttccatggc tttaccaatc 
ctetctctca atggatattt 
tgtcacacct tgtgactgca 
atttcctgaa sggtttccaa 
tgggaaccct atccatgcat 
ccataagceg ctatgetacc 
atgagaaaat attttatggc 
aactatgcat ttacatatgg 



taggagccat gatctctcag 
ttttagtagt actaecatca 
gtecgtccat tatggagaaa 
tccag*ttct actaatagtt 
ttctacacc* aagagataac 
tcacctgtct tgcttcggce 
aaacattcaa gaagacacta 
catatggttg a 



<210> 26 

<211> 336 

<212> PRT 

<213> nemo .api.no 

<400> 2« 

Mat Asa. Asn Asn Thr Thr Cys Ila Gin Pro Sar Met II* Ser i 
15 10 

Ala Lau Pro II* II* Tyr II* Lau Leu Cya II* Val Gly Val 

20 23 30 

Asn Thr Lau Sar Gin Trp lie Pha Lau Thr Lys He Gly Lys 
35 40 . 45 

Sar Thr Mis Ila Tyr Lau Ser His Lau Val Thr Ala Asa Lau 
50 S3 60 

Cys Ser Ala Mat Pro Ph* Mat Sar II* Tyr Pba Leu Lys Gly 

6S 70 75 

Trp Glu Tyr Gin Sar Ala Gin Cys Arg Val Val Asn Ph* Lao 



15 

Pha Gly 

Lys Thr 

Lau Val 

Ph* Gin 
•0 

Gly Thr 
95 

S*r Trp 



120 
1B0 
240 
300 



600 
660 
720 



960 
1011 



WO 01/JM71 



<210> 27 

<211> 1014 

<212>' DMA 

<213> Hono sapiens 



atgaatgage cactagaeta 


tttagcaeat gcttctgatt 


tccccgatta 


tgeagctget 


60 


tttggasatt gcactgatga 


aaacatece* etcaagatgc 


actacctccc 


tgttetttat 


120 


ggcattatet tectcgtggg 


atttccaggc aatgeagtag 


tgatatccac 


ttacattttc 


180 


aaaatgagac cttggaagag 


cagcaccatc attatgetga 


acctggcctg 


cacagatctg 


240 


ctgtatctga ccagcctecc 


cttcctgatt cactactatg 


ccagtggcga 


•aactggatc 


300 


tttggagatt tcatgtgtaa 


gtttatcege ttcagcttcc 


atttceacct 


gtatagcagc 


360 


atcctcttcc teaectgttt 


eegcatcttc cgctactgtg 


tgatcattca 


cccaatgagc 


420 


tgcttttcca ttcacaaaae 


tcgatgtgca gttgtagcct 


gtgctgtggt 


gtggatcatt 


4 00 


tcactggtag ctgtcattec 


gatgaccttc ttgatcacat 


caaccaacag 


gaccaacaga 


340 



PCT/US00/315O9 



tcagcctgtc tcg»cctcac cegttcogat geactcaata ct»tt**gtg gtecaacctg 600 

ettttgactg caactacttt etgcctcccc ttggtgatsg tgecactttg ctataccacg SCO 

attatccaca ctctgaccca tggactgcaa ectgaeagct gccttaagca gaaagcacga 720 

aggctaacee ttctgctact ccttgcattt tacgtatgtt ttttaccctt ccatatcttg 700 

egggtcettc ggaccgaatc tcgcctgctt tcaatcagtt gttccattga gaatcagatc 040 

catgaagctt eeatcgtttc tagaccatta gctgctctge acacctttgg tucctgcu §00 

ctatatgtgg tggtc*g«ge caactttcag caggctgtct gctcaacsgt gagatgcaaa 960 

gtaagcggga accttgagca agcaaagsaa attagttact caaacaaccc ttga 1014 

<210> 20 

«211> 337 

<212> PUT 

<213> Horao sapiens 

<400> 20 

M«t Aan Glu Fro L«u Asp Tyr Leu Ala Asa Ala Sar Asp Phe Pro Asp 

1 5 10 15 

Tyr Ala Ala Ala Fha Gly Asn Cy* Thr Asp Glu Asn I la Pro L*u Lys 

20 25 30 

Hat His Tyr In Pro Val 11a Tyr Gly Ila 11a Hw lam val Gly Phe 
35 40 45 : 

Pro Gly Asn Ala Val Val Ila Sar Thr Tyr He Phe Lys Mac Arg Pro 
50 55 60 

Trp Lys S«r Sar Thr Ila Ila Met Lan Asn Leu Ala Cys Thr Asp Leu 

65 70 75 00 

Leu Tyr Leu Thr 8er Leu Pro Phe Leo He His Tyr Tyr Ala Sex Gly 



Leu Pro Leu Val He Val Thr Leu Cys Tyr Thr Thr He He His Thr 



WO 01/36-17) 



PCT/US00/31509 



Val 9«x Asp Asn Phe Gin Gin Ala Val Cya Ser Thr Val Arg Cys Lys 
305 310 315 * r 320 



<210> 29 

<211> 993 

<212> DMA 

<213> Hobo 



aggaacgcct 
ttccagatta 
ttccctegct 



agcatcttgg 
cagacatttg 



tgctttgtgg 
ggctggtagg aaacgggttt 
tctctgtct* cgtcctcagc 
teeattgcct ggtgtacctc 
tgtgatgacc 
ctgcctgtcc 
ggtcgtgtgt 
ctgtggcttc 
attteatcac tgcagcgtgg 



gcaccgagcg 
acctgtcagc 



ctgatcccgg 
gtgctctggc 
etggccgggg 
agtaacttct 
tgtgcctace 
gtcetgtggc 
gtcctgctct 
ttatttagtg 
ctgatttttt 



cagtgaatgg 
tcttcctgat 
tcctgggctt 
ccgacttect 
tctgttccat 
ttgcaggect 
ccatctggte 
gggccctgtc 
atggtgactc 
tattcatggt 



ccttttcatr 
ccgcatgcge 
ettcetctgc 
ctceatcaet 
gagcatgctg 
tcgctgccgc 
cetactgctg 



tctetgtggg 
ectgaccagg 



»«H S t W t 

catccagttt 
ttcgtgggct 



tcctaatatt 
cagttgtcct 
cttttaggaa 
tgcaggacat 
tgtcgagaag 



atggatctgg 
gtcatctctt 



tgctgaggeg 
cagtetggtg 



gtgttcctcc 
aaggattctg 
eacsgcagtg 
ctgcagcagc 
gstcacagtg 



tctgcggcct 
atgtcttatt 



egetcctcaa 
aaggatgctt 



ttgtcatatt 
catttacttc 
gctggctctc 



120 
1B0 
2*0 
300 
360 
420 
4B0 



660 
720. 
760 
040 



PCTAJS00A31509 



<210> 30 

<211> 330 

<212> PRT 

<213> Hobo 



<400> 30 

Met Asp Pro Thr Thr 
1 5 

Gly Asn Asp Gin Ala 

20 

Pro Val Phe Leu He 

35 

Gly Phe Val Leu Trp 
50 

Ser Val Tyr Val Leu 

65 

Phe Gin He Ha Asn 
B5 

He Ser He 



100 



Ser He Leu Glu Gly I 
165 

Ser Gly Trp Cys Gin 1 



Pro Ala Trp Gly Thr Glu Ser Thr Thr Val Asn 
10 15 

Leu Leo Leu Leu Cys Gly Lys Glu Thr Leo He 

25 30 

Leu Phe He Ala Leu Val Gly Leu Val Gly Asn 
40 45 

Leu Leu Gly Phe Arg Met Arg Arg Asn Ala Phe 

55 60 

Ser Leu Ala Gly Ala Asp Phe Leu Pbe Leu Cys 

70 75 80 

Cys Leu Val Tyr Leu Ser Asn Phe Fha Cys ser 

50 95 

Val Net Thr Cys Ale 
110 

Ser Thr Glu Arg Cys 
125 

Arg Arg Pro Arg Bis 
140 

Cys val Leu Leu Trp Ala Leu Ser Leu Leu Leu 

150 155 160 

i Phe Ser Asp Gly Asp 

175 

c Ala Ala Trp Leu He 



a Leu Val Arg 
5 

u Tyr Leu Thr 

220 

• Gly Leu Pro Phe Gly 

i Asp Ser Asp Val Leo 

255 

i Ser Ser Leu Asn Ser 

270 

tie Tyr Phe Phe val Gly Ser Phe Arg Lys Gin 



PCT/USOO/31509 



<210> 31 

<211> 1092 

<212> DMA 

<213> Mono t 



<i00> 31 
atgggccccg 


gcgaggcgct 


gctggcgggt 


ctcctggtga 


tggtactggc 


cgtggegctg 


60 


ctatccaacg 


cactggtgct 


gctttgttgc 


gcctacagcg 


ctgagctecg 


cactegagcc 


120 


tcaggcgtcc 
cccttcacgc 


tcctggtgaa 
tgctcggtgt 


tctgtcgctg 
gatgcgcggg 


ggccacctgc 
cggacaccgt 


tgctggcggc 


gctggacatg 
cgcatgccaa 


180 
240 


gtcattggct 


tcctggacac 


cttcctggcg 


tccaacgcgg 


cgctgegcgt 


ggcggcgctg 


300 


agcgcagacc 


agtggctggc 


agtgggcttc 


ccactgcgct 




ectgcgaccg 


360 


cgetatgccg 


gcctgctgct 


gggctgtgce 


tggggacagt 


cgctggcctt 


ctceggcgct 


4 20 


gcacttggct 


gctcgtggct 


tggctacagc 


egcgccttcg 


cgtcctgttc 


gctgcgcetg 


480 


ccgcccgagc 


ctgagcgtcc 


gcgcttcgca 


gccttcaccg 


ccacgctcca 


tgcegtgggc 


540 


ttcgtgctgc 


cgctggcggt 


gctctgcctc 


acctcgctcc 


aggtgcaccg 


ggtggcacgc 


600 


agccactgcc 
c»cccc*gtg 


agcgcatgga 


caccgtcacc 
ctgcctcatc 


atgaaggcgc 
cagcagaagc 


tcgcgctgct 
ggcgccgcca 


cgccgacctg 
ccgcgccacc 


660 
720 


aggaegattg 


gcattgctat 


tgcgacctte 


ctcatctget 


ttgccccgta 


tgtcatgacc 


780 


aggctggcgg 


•gctcgtgcc 


cttcgtcacc 
ggtggccgac 


gtgaacgccc 
ccgttcacgt 


agtggggcat 


cctcagcaag 


840 
900 


tgcctgacct 


acagcaaggc 

tcetggccgg 


catggtgcac 


cggctgctga 


actctctgct 
agagaacccc 


gcgcccegca 


960 


tecacccatg 


acagetctet 


ggatgtggec 


ggeetggtgc 


accsgctgct 


gaagagaacc 


1020 


ccgcgcccag 


cgtccaccca 


eaaeggctct 


gtggacacag 


agaatgatte 


ctgcctgcag 


1060 


eagacocact 


9* 










1092 



<210>- 32 

<211> 363 

<212> PPT 

<213> homo sapiens 

<400> 32 

Met Gly Pro Gly Glu Ala Leu Leu Ala Gly Leu 

15 10 

Ala val Ala Leu Leu Ser Asn Ala Leu Val Leu 



Leu Val Met Val Leu 

15 

Leu Cys Cys Ala Tyr 



WO 01/3*47] 

Sar Ui Glu Ln Mf Thr Ail W» *« Cly Val Leu L*u Val Aan Leu 
35 40 45 

Sex In Cly Hl» Ln L*u Uo Al* Al* L*u Asp M*t Pro Rm Thr Leu 
50 55 60 

Leu Cly Val Hat Arg Gly Axg Thr fro Ser Ala Fro Cly Ala Cys Gin 
65 70 15 00 

Val 11a Gly Ml* Lao Asp Thr Phe Lau Ala Ser Asa Ala Ala Lan S«r 



65 



95 



Cln Trp Cly Ila Lau 8ar Lys cys Leu Thr Tyr Sar Ly» Ala Val 



<400> 34 

Mat Pro Thr Lau Asn Thr Ser Ala Sax Pro Pro Thr Pha Ph« Trp Ala 
15 10 15 

Asn Ala Sar Cly Cly Sar Val Lau Sar Ala Asp Asp Ala Pro Met Pro 

20 25 30 

val Lya Pha Lau Ala Lau Arg Lau Met Val Ala Lau Ala Tyr Gly Lau 
35 40 45 

val Cly Ala He Cly Lau Leu Gly Asn Lau Ala Val Lau Trp Val Lau 

50 55 60 

Sex Asn Cys Ala Arg Arg Ala Pro Gly Pro Pro Sar Asp Thr Pha Val 
65 70 75 00 

Page 33 
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<213> Jkaac 


> sapiens 








<400> 33 
atgcccacac 




tgcctctcce 


cccacattct tctgggecaa tgcctccgga 




ggeagtgtge 


tgsgtgctgs 


tgatgetec* 


atgcetgtea aattoctage cctgaggctc 




• t g g t 1 9 CCC 


tggcctatgg 


gcttgtggeg 


gecattgget tgctggg*** tttggcggtg 


180 


etgtgggtac 


tgagtaactg 


tgeceggaga 


gcccctggcc caccttcags caccttcgtc 


2*0 


ttcsacctpg 


ctctggcgga 


cctgggactg 


gcactcactc teccettttg ggcagecgag 


300 


teggcaetgg 


actttcactg 


gecett egg* 


ggtgccctct gcsagatggt tctgacggcc 


360 


actgtcctca 


•cgtcutge 


cagcatcttc 


eteatcecag cgctgagcgt tgctcgctac 


420 


tgggtggtgg 


ecatggctgc 


OOgoccaggc 


scccacctct eactcttctg ggeccgeata 


480 


gccaccctgg 


cagtgtggge 


ggcggctgcc 


ctggtgacgg tgcccacege tgtcttcggg 




gtggagggtg 


aggtgtgtgg 


tgtgcgcctt 


tgcctgctgc gtttccccag caggtactgg 


600 


ctgggggcct 


accagctgca 


gagggtggtg 






accaccagct 


sect get get 


gctggccttc 


ctgcagcggc ggcaacggcg geggcaggae 


. 720 


agcagggtcg 


tggcccgctc 


tgtccgcatc 


etggtggett ccttcttcct ctgctggttt 


700 


cccaaccatg 


tggtcactct 


ctggggtgte 


ctggtgaagt ttgacctggt gccctggaac 


840 


egtactttct 


atactatcca 


gacgtatgtc 


ttccctgtca etacttgett ggcecacagc 


soo 


aatagctgee 


tcaacectgt 


gctgtactgt 


ctcctgaggc gggagceccg gcaggctctg 


960 


gcaggcacct 
caggtggccc 


tcagggatct 
taaagcsggt 


geggtegagg 
aggcaggegg 


ctgtggccce agggeggegg ctgggtgca* 
tgggtcgcaa gcaacccccg ggagegcege 


102 0 
1OB0 


cettetaece 


tgctcaccas 


cctggacaga 


gggecacccg ggtga 


1125 


<210> 34 
<211> 374 
<212> PRT 
<213> Host 


a sapiens 









WO 01/36471 



PCTYUS00/315O9 



WO 01/36471 



Pha Asn Leu Ala Leu 
B5 

Trp Ala Ala Clu Sar 

100 

Leu Cys Ly* Mat Val 

115 



Ala Thr Leu Ala Val 

165 

Ala val Pha Gly Val 



Ala Asp 
Ala Leu 
Leu Thr 



Gly 1 

! 

Asp Ph* Bis 1 

105 

Val 1 
Ala J 



105 
l Thr \ 



i Leu Thr Leu I 
! 

> Pro Ph* Cly C 



i Leu Ser Val 1 

135 



Trp Ala 

Clu Cly 
Arg Tyr 



i Leu Sex Leu 



Ala Ala Ala Leu 



225 

Sex Arg Val Val Ala 



230 

Arg Ser 1 
a Asn Mis \ 
I pro Trp I 



170 

j Gly 
> Cly 
I *rg 



Phe Trp Ala 

Val Thr Val 

Val Arg Leu 
190 



Tyr Val Pha Pro Val 1 
290 

Asn Pro Val Leu Tyr < 



I Asp Leu 

i 

i Val Ala 

j Clu Ser 
> Cly 



200 

r Cys 1 
I 

i Lau I 

Axg t 

Leu 1 

Arg I 



: Thr 
) Ala I 

> Arg C 

r Arg I 
330 

l Gin » 
i 

> Ser 1 



Cln Arg Arg Arg 
235 

Leu Val Ala Ser 

Lau Trp Gly val 
270 

Phe Tyr Thr lis 

203 

His sex Asn ser 

300 

Clu Pro Arg Cln 
315 

Leo Trp Pro Cln 

l Cly Axg Axg 
350 



val Ala 

Axg He 
160 

Pro Thr 
175 

Cya Leu 

Cln Arg 

Sex Tyr 

Gin Asp 
240 



Cln Thr 
Cys Leu 



Trp val 
Asn Leu 



<210> 35 

<211> 1092 

<212> DMA 

<213> Hono sapiens 



ggatcacttt ctggaaatag acaagaagaa ctgctgtgtg 
Page 34 



ttccgagatg 


octteattgt 


caaggtgttg ccgecggtgt tggggctgga 


gtttatcttc 


120 


gggcttctgg 


geaatggect 


tgccctgtgg attttctgtt tccacctcaa 


gtcctggaaa 


IBO 


tccagccgga 


ttttcetgtt 


caacctggca geggctgact ttctactgat 


catctgcctg 


240 


cccttcctga 


tggacaacta 


tgtgaggcgt tgggactgg* agtttgggg* 


catccettgc 


300 


eggctgatge 


tcttcatgtt 


ggctatgaac cgccagggca gcatcstett 


cc tea eggtg 


360 


gtggcggtag 


acaggtattt 


ccgggtggtc eatccccacc acgccctgaa 


ca a gate tec 


420 


aatcggaeag 


cagccatcat 


ctcttgcctt ctgtggggca tcactattgg 


cctgacagtc 


«B0 


cacctcctga 


sgeegeegat 


gccgatccag aatggcggtg caaatttgtg cagcagcttc 


540 


agcatctgcc 


ataccttcca 


gtggcacgjia gccatgttcc tcctggagtt 


cttcctgccc 


600 


ctgggcatca 


tectgttctg 


c*cagccaga attatctgga gcctgcggca 


gagacaaatg 


660 


gaeeggcatg 


eeaagatcaa 


gagagecate accttcatca tggtggtggc 


categtcttt 


720 


g-tcatctget 


tccttcccag 


cgtggttgtg eggetcege* tcttctggct 


cctgcacact 


760 


tegggcaege 


agaattgtga 


agtgtaccgc tcggtggacc tggegttctt 


tatcactctc 


840 


agcttcacet 


aeatgaacag 


catgetggae cccgtggtgt actacttctc 


cegcccatcc 


900 


ttteccaact 


tcttctccac 


tttgatcaac cgctgcctcc agaggaagat 


gacaggtgag 


960 


ccagataata 


eeegcagcec 






1020 


gctccsgagg 


cgttaatggc 


caactccggt gagccatgga gcccctctta 


tetgggecca 


1000 


•cctctcctt 








1092 



<210> 36 

<211> 363 

<212> PRT 

«213> Hono sapiens 

<400> 36 

Met Asn Arg 81s His Leu Gin Asp Bis Phe Leo Clu 11a Asp Lya Lya 
15 10 15 

Asr. Cys Cys Val Ph* Arg Asp Asp Phe II* Val Lys Val Leu Pro Pxo 

20 25 30 

val Lent Cly Leu Clu Pho 11* Ph* Gly Lau Leu Cly Asn Gly Leu Ala 
35 40 45 

Leu Trp lie Ph* Cya Pb* His Lou Lys S*x Txp Lys Sax Ser Arg II* 

SO 55 60 

Phe Leu Ph* Asa L*u Ala Val Ala Asp Pha Leu Leu II* He Cys Leo 
65 70 75 00 

Pro Ph* Lou Mat Asp Asn Tyr v*l Arg Arg Trp Asp Trp Ly» Ph* Cly 



85 



90 



PCT/US00V3 1 509 



• Gin Trp Bis Glu Ale Met 



Lya Thr Arg Gly Ala Pro Clu U> Leu Met Me Asn Ser Gly Gin Pro 



<210> 37 

<211> 10«4 

<212> DNA 

<213> Homo sepiei 



atgggggatg agctggcacc ttgccctgtg ggcactacag 


cttggceggc 


cctgetcceg 


60 


ctcetcegce egacaccctg catgececaa gcagccagca 


ecacttcctt 


gggcctgggg 


120 


gecctceggg tgcccegctc catgctgtac tggcttttcc 


ttccctcaeg 


cctgctggct 


180 


gcegccecec tggctgtcag ccccctgctg ctggtgacca 


tcctgcggaa 


ccaecggctg 


240 


cgaceggagc cccactaect gctcccggct aecatcctgc 


tctcagacct 


ggcctecatt 


300 


ctcctccaca tgctcatetc ctccagcage ctgggtggct 


gggagctggg 


ccgcatggcc 


360 
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tgtggcettc 


tcactgatgc 


tgtcttcgce 


gcctgcacc* 
atccatccac 


geaccatcct gtccttcacc 


420 

480 


9ccattgtgc 
tcccetgggg 


tgcecaccts 
etgcctggaa 


ecrggcagtc 
ggcagr.ggc« 


ctcatctggc 


tgcgctacct ctccttcatg 
tggtggcctg ctgcttcccc 


540 


acattectta 


tttgfctceg 


caagtggcag 


gat gcc cage 


tggaggagca aggagcttca 


600 


tacatcctac 


caccaagcat 


gggcacceag 


ccgggatgtg 


gcctcctggt cattgttacc 


660 


tacecctcca 


ttctgtgcgt 


tctgttcctc 


tgcacagctc 


teartgeca* ctgtttctgg 


720 


aggatctetg 


cagaggccaa 


gacttceggc 


•tctgggggc 


agggctattc ccgggccagg 


7B0 


ggcaccctgc 


tgatccactc 


agtgctgate 


ecattgtacg 


tga9caca99 ggtggtgttc 


840 


tccctggaca 


tggtgetgae 


caggtaccac 


cecattgact 


ctgggactca cacatggctc 


900 


ctggcagcta 


•cagtgaggt 


actcatgatg 


cttccccgrg 


ccatgctccc ataectgtee 


960 


ctgctecgct 


accggcegc t 


gttgggcatg 


gtccggggcc 


acctcceatc caggaggcac 


1020 


caggccatct 


ttaccatttc 


ctag 






1044 



<210> 3B 

<211> 347 

<212> PRT 

<213> Mono sapiens 



Ala Lm He Gin Leu lie Ser Lya Thr Pro Cys Mat Pro 

20 25 

Ser Asn Thr Ser Leu Gly Leu Gly Asp Leu Arg Val Pro 

35 40 45 

Leu Tyr Trp Lea Phe Leu Pro Ser Ser Leu Leu Ala Ala 

50 55 6D 

Ala Val Ser Pro Leu Leu Leu Val Thr lie Leu Arg Asn 
65 70 75 

Arg Gin Clu Pro Bis Tyr Leu Leu Pro Ala Asa lie Lau 



85 



90 



Ala Trp Pro 

15 

Gin Ala Ala 
30 

Ser Ser Met 

Ala Thr Leu 

Gin Arg Leu 
80 

Leu Ser Asp 
95 



125 

' 140 

I Tyr Leu 

i lie Trp 
. 37 



Asp Ala val 
lie Val Leu 



Ser Arg Ala Arg Gly Thr leu Leu He His Ser val Leu He Thr Leu 



<210> 39 

<211> 1023 

<212> DNA 

<213> Homo sapiens 
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aataaagagt ttgettetea aaetgccagt gtggtagata cagtcatcct cccttccatg 120 

attgggatta tctgttcaac egggctggtt ggcaacatcc tcattgtatt caetataata 180 

agatecegga aaaaaacegt ccctgacatc tatatctgea acetggctgt ggctgatttg 240 

gtccacatag ttggaatgcc ttttcttatt- caccaatggg cccgaggggg agagtgggtg 300 

tttggggggc ctctctgcac catcatcaca tccctggata cttgtaacca atttgectgt 360 

agtgccatca tgactgtaat gagtgtggac aggtactttg ccctcgteea accatttcga 420 

ctgacacgtt ggagaacaeg gtacaagace ateeggatea atttgggect ttgggcegct 480 

tcctttatcc tggcattgee tgtctgggtc tactcgaagg tcatcaaett taaagacggt 54 0 

gttgagagtt gtgcttttga tttgacatcc cctgacgatg tactetggta tacactttat 600 

ttgacgataa caactttttt tttccctcta cccttgattt tggtgtgcta tattttaalt 660 

ttatgefcate cttgggagat gtatcaacag aataaggatg ecagatgetg caatcccagt 720 

gtaccaaaac agagegtgat gaagttgaca aagatggtgc tggtgctggt ggtagtcttt 780 
Page 38 



atcctgagtg ctgcccctta tcatgtgete caactggtga ecttacagat ggaacagccc 

acactggcct tctatgtggg ttattacctc tccatctgtc teagctatge cagcagcagc 

attaaccctt ttctctacat ectgetgagt ggaaatttce agaaaegtet gcctceaatc 

caaagaagag cgactgegaa ggaaatcaac aatatgggaa acaetctgaa atcacacttt 1020 
tag 1023 

<210> 40 

<211> 340 

<212> PRT 

<213> How sapiens 

<400> 40 

Met Asn pro Phe His Ala Sec Cys Trp Asn Thr Ser Ala Glu Leu Leu 



PCT/US00/3I509 

840 

900 
960 



10 



15 



Asn Lya Ser Trp Asn Lys Glu Phe Ala Tyr Gin Thr Ala Ser Val Val 

20 25 30 

Asp Thr val He Leu Pro Ser Met He Gly He He Cys Ser Thr Gly 

35 40 45 

Leu Val Cly Asn He Leu He Val Phe Thr He He Arg Ser Arg Lys 

50 55 60 

Lys Thr val Pro Asp He Tyr He Cys Asn Leu Ala Val Ala Asp Leu 

65 70 75 80 

Val His He Val Gly Met Pro Phe Leu He His Gin Trp Ala Ar9 Gly 



val Asp Arg Tyr Phe Ala Leu Val Gin Pro Phe Arg Leu Thr Arg Trp 



Val Pro Lya Gin Arg Val Met Lys Leu Thr Lys Met Val LM Val Leu 
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<210> 41 

<211> 24 

<212> OTA 

<213> Artificial 



<220> 

<221> aloe f«itun 
<223> BoraT Sequence 



«210> 42 

<211> 24 

<212> OTA 

<213» Artificial 



<210> 43 

<211> 20 

<212> OTA 

<213> Artificial 



<220> 

<221> nlsc lutwt 
<223> HoveT Sequence 



<210> 44 
<211> 24 
<212> DMA 



<213> Artificial Seqt* 
<220> 

<221> mite feature 

<223> MoreT Sequence 



<210> 45 

<211> 21 

<212> OTA 

Oi3> Artificial 



<220> 

<221> alec fMtun 
<223> MoeeT Sequence 



<210> 46 

<211> 24 

<212> OTA 

<213> Artificial Sequence 

<220> 

<221> miac faatura 

<223> »<rrel Sequence 



<210> 47 

<211> 26 

<212> OTA 

<213> Artificial 



<220> 

<221> nlsc ft»tur« 
<223> Koral Sequence 



<210> 4B 

<211> 26 

<212> OTA 

<213> Artificial 



<220> 

<221> nlsc faatura 
<223> Wor.T Sequence 



<210> 4» 

<211> 2 6 

<212> OTA 

<213> Artificial 

<220> 

<221> ease re* tor* 

<223> Mov.I a 



<210> SO 

<211> 26 

<212> OTA 

<213> Artificial 

<220> 

<221> nlsc faatora 

<223> Moral ! 



<210> 51 

<211> 23 

<212> OTA 

<213> Artificial 



<220> 

<221> aiac feature 
<223> Store! Sequence 



<210> 52 

<2U> 23 

<212> OTA 

<213>- Artificial 



<220> 

<221> aiac raatura 
<223> Horel Sequence 



<210> 53 

<211> 31 

<212> OTA 

<213> Artificial 
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<223> Hovel Sequence 



<210> 54 

<211> 27 

<212> OTA 

<213> Artificial Sequence 

<220> 

<221> ad.se faatura 

<223> BoveT a 



<210> 55 

<2ll> 26 

<212> OTA 

<213> Artificial 



<220> 

<221> nice faatura 
<223> WoreT Saquanca 



<210> 36 

<211> 26 

<212> OTA 

<213> Artificial Saquanca 
<220> 

<221> aiac faatura 

<223> HoraT Saquanca 



<210> 57 

<211> 23 

<213> Artificial Saquanca 
<220> 

<221> raise faatura 

<223> Bora I Saquanca 



Fay* 42 



P»g« 43 



<2i2> osa 

<2ll> Artificial Saquanca 

<220> 

<221> nlac ftitur* 

<223> Hovel J 

«oa> sa 



FCT/USOQ/315OT 



ggaggttgcg ecttagegae agatgacc 



<213> Artificial ! 



<211> 26 

<212> DBA 

<213> Artificial Saquai 
<2J0» 

<22l> adsc fioturt 

<223> NovaT Saquaaca 



<210> 60 

<211> 25 

<212> DMA 

<213> Artificial t 



<210> 61 

<211> 2B 

<212> OKA 

<213> Artificial 

<220> 

<221> *i»c faatura 

<223> Ncn " 



<210> 62 

<211> 29 

<212> DMA 

<213> Artificial 

<220> 

<221> miac featara 

<223> NovaT S 



<210> 64 

<211> 25 

<212> DMA 

<213> Artificial Sequenca 
<220> 

<221> »i»c faatura 

<223> Moval Saquanca 



<210> €5 

<211> 26 

<212> DMA 

<213> Artificial Sequence 

<220> 

<221> miac faatura 

<22 3> Moval 3aqu«nc* 



<210> 66 

<211> 26 

<212> DMA 

<213> Artificial 



«220> 

<221> miac faatura 
<223> NovaT saquanca 



<210> 67 

<211> 23 

<212> DMA 

<213> Artificial 
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<211> 23 
<212> DMA 
<213> Artificial 



ggagacaacc atgaatgagc cac 



<210> 69 

<211> 24 

<212> DMA 

<213> Artificial Saquanca 
<220> 

<221> aiac faatura 

<223> Not 



<210> 69 

<211> 21 

<212> DMA 

<213> Artificial Saquanca 



<400> 69 

ggcaccagtg gaggttttct gagcatg 

<210> 70 

<211> 27 

<212> DMA 

<213> Artificial 



<220> 

<221> miac faatura 
<223> HovaT Saquanci 



<210> 73 

<211> 27 

<212> DMA 

<213> Artificial 



<220> 

<221> miac faatura 
<223> NovaT Saquanca 



<40Q> 73 

ctgcgatgcc cacactcaat acttctg 

<210> 74 

<211> 27 

<212> DMA 

<213> Artificial Saquaaca 

<220> 

<221> miac faatura 

<223> NovaT S 



<210> 75 

<211> 22 

<212> DMA 

<213> Artificial Saquanca 



<210> 71 

<211> 23 

«212> DMA 

<213> Artificial 



<220> 

<221> nlac faatura 
<223> Nov«T Saquanca 



<210> 76 

<2U> 2< 

<212> OHA 

<213> Artificial 



<220> 

<221> miac faatura 
<223> NovaT Saquanca 



<210> 72 



Paga 46 



<210> 77 



Paga 47 



<211> 24 

<212> DMA 

<213> Artificial 



» agc***ogtg gtcctcctoo 



<210> 78 

<211> 27 

<212> DRA 

<213> Artificial 



<220> 

<221> aisc feature 
<223> Hovel Sequence 



<210> 82 

<211> 30 

<212> DBA 

<213> Artificial 



<220> 

<221> aisc feature 
<223> Novel Sequence 



<210> A3 

<211> 1356 

<212> DMA 

<213> Bcrao sapiens 



pct/usooaism 



teg* a ctctagg 



<210> 79 

<211> 20 

<212> DMA 

<213> Artificial 



<220> 

<221> misc feature 
<223> Hovel Sequence 



<210> AO 

<211> 25 

<212> UNA 

<213> Artificial 

<220» 

<221> ad.sc feature 

<223> Novel « 



<210> 81 

<211> 30 

<212> OKA 

<213> Artificial 



<220> 

<221> Miac feature 
<223> Wov.T Sequence 



: cacccatccc ccagtcatca gggaactctt ccactttggg gagggtccct 
j gtccctctac tgccagtggg gtcccggagg tggggctacg ggatgttgct 



tcggaatctg tggecetctt cttcatgctc ctgctggact tgactgctgt ggctggcaat 

cttcgtcttc 
catgctctcc 
ctacttgttt 
tgtggagcgc 
gctggtggcc 
egtgttggga 
ccagtggagc 



cacctctgcc tggtggacct gctggctgcc etgaccctca' 
agctctgccc tctttgacca cgccctcttt ggygaggtgg 
ctgagcgtgt gctttgtcag cctggccatc ctctcggtgt 
tactattaeg tagtccaccc eatgcgctac gaggtgcgc* 
tctgtgctgg tgggtgtgtg ggtgaaggcc ttggccatgg 
agggtctcct gggaggaagg agctcccagt gtccccccag 
cacagtgcct actgccagct ttttgtggtg gtctttgctg 
ctgctcetca tacttgtggt ctactgcagc atgttccgag 
cagcacgggc cgctgcccac gtggatggag acaccecgge aacgctccg* atctctcagc 



cctgccgcct 
cagccatcaa 
tgacgctcgg 
cttctgtgce 
gctgttcact 
tectttactt 



gggggaggga aagcaaaggt ggttcteetg gctgtggggg gacagttcct gctctgttgg 
ttgccctact tctctttcca cctctatgtt gccctgagtg ctcagcecat ttcaeetggg 
caggtggaga gtgtggtcac ctggattggc tacttttgct tcacttccaa ccctttcttc 
tatggatgtc tcaaccggca gatccggggg gagctcagca agcagtttgt ctgcttcttc 
aagccagctc cagaggagga gctgaggctg cctagccggg egggctccet tgaggagaac 
ttcctgcagt tecttcaggg gactggctgt ccttctgagt cctgggtttc ccgaccccta 



120 
180 
240 
300 
360 
420 
«B0 
1*0 
600 
660 
720 
790 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
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gagacctctg agttcctgga 
ctccgtcctg ccgcctcacc 

<210> 84 

<211> 451 

<212> PRT 

<213> Homo aapleaa 

<400> 84 

Hat Glu Ser 5er Pro V 



gcagcaactc accagcgaca tcatcatgtc agacagctac 1320 

1356 



Gly Arg Val Pro Gin 
20 

Clu Val Cly Leu Arg 
35 

Met Leu Leu Leu Asp 

50 

Ala val lie Ala lys 

65 

Hla Leu Cys Leu Val 
B5 

Ala Mat Leu Ser Ser Ser J 

100 

Val Ala Cya Arg Leu Tyr 1 
115 

Ala lie Leu Ser Val Ser I 



Pro Cln Ser 

Pro Gly Pro 

Val Ala 8er 
40 

Thr Ala Val 

55 

Pro Ala Leu 
Leu Leu Ala 



Ser Val Leu Val CI; 



Glu Val Arg 
Trp val Lya 



Pro Val Leu Cly Arg Val ! 



a Cln Trp Ser Hi. 



Ser Gly Asn 
10 

Ser Thr Ala 

Clu Ser Val 

Ala Gly Asi> 
60 

Arg Lya Phe 

75 

Ala Leu Thr 
90 

Asp His Ala 

Ser Val Cya 

Val Glu Arg 
140 

Met Thr Leu 

155 

Ala Leu Ala 

170 

Glu Gly Ala I 
Ser Ala Tyr < 



Ser Ser 

Gly 
30 

Ala Leu 
45 

Ala Ala 
Val Vtm 
Leu Met 

Leu Phe 

110 

Phe Val E 

125 

Tyr Tyr 1 
Cly Leu V 
Met Ala S 



Ser Gly Val Pro 



val Met 

val Phe 
80 

Pro Leu 

95 

Cly Clu 



Val val Val Phe Ala Val. Leu Tyr Phe I 
210 215 

: Met Phe Arg \ 



i Pro Thr Trp Met < 



> Leu Leu 
1 Val Ala 

i Arg Gin 



ST 



Gin Thr Thr Pro Bis 



265 

Thr Phe Cly 



Val Thr Ser S 
cly Cly Lya * 



f Ala Pro 

> • 

i Val Val 
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I Ala Leu Set 

) 

L Thr Trp lie 



315 

Cly Tyr 
330 

i Arg Gin 

5 

i Pro Ala I 

l Glu Glu J 

i Pro Ser Glu Ser Trp * 
I 395 

i Pro Pro Ala Val Aap I 
410 

Gin He Ala Glu Clu Thr Ser Glu Phe Leu Clu ( 



Leu Glu Ser 



Pro He ! 
Phe Cys I 
He Arg ( 



j Glu Glu Leu 

i 

I Leu Gin Pha 
c Arg Pro Leu 



t Ser Pro Arg 



<210> B5 

<211> 28 

<212> DMA 

<213> Homo sapiena 



<210> 86 

<211> 29 

<212> DMA 

<213> Bono sapiena 



<210> 87 

<211> 1041 

<212> PHA 

<213> Mono sapiens 



ggcaccttcs gcaataaca* 
ttcccaattg tatatetgat 



cttgcae.cc*. tccatctccg tatcagaaat ggaaccaaat 
cagcaggeac tgcacaattg aeaacttcea gagagaattt 
aatatttttc tggggagtet tgggaaetgg gttgtccata 
Page 51 



120 
180 



WO 010*471 



teeetcT"/ 240 
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tatgttttcc tgcagectta teagaagtcc ecatctgtga acgttttcat gcteaeti 

gccatttcag atetcctgtt cataagcacg ettcecttei gggctgact* ttatcttaga 300 

ggctccaatt'ggatatttgg agacctggcc tgcaggatta tgtcttatte cttgtatgtc 360 

aacatgtaca gcagtattta ttteetgacc gtgctgagtg ttgtgegttt ectggcaatg 420 

gttcacccct ttcggcttct gcatgtcacc agcatcagga gtgcctggat ectctgtggg 490 

atcatatgga tccttatcat ggcttcctca ataatgctcc tggacagtgg ctctgagcag 540 

aaeggcagtg tcacatcatg cttagagetg aatctctata aaattgetaa gctgcagacc 600 

acgaactata ttgccttggt ggtgggctgc ctgctgccat ttttcacact cagcatctgt 660 

tatctgctg* tcattcgggt tctgttaaaa gtggaggtcc cagaatcggg gctgcgggtt 720 

xatcatc atcaccttga tcatcttett cttgtgtttc 780 

:»ctg»g gaccgtccac ttgacgacat ggaaagtggg tttetgcaaa 640 ' 

gacagactgc ataaagctxc ggttateaca ctggccttgg cagcagccaa tgectgcttc 900 

aatcctctgc tetattactt tgctggggag aattttaagg ecagactaaa gtctgcactc 960 . 

agaaaaggcc atccacagaa ggcaaagaca aagtgtgttt tccctgttag tgtgtggttg 1020 

1041 



<210> 83 

<211> 34 6 

<212> PUT 

<213> Booo aapian* 



Hat Gin Arg Lys Pha Hat Sar Leu sin Fro Ser lla Sar v*l Sar Glu 
1 5 10 15 

Hat Glu Pro Asn «ly Thr Pba Sax AsnAsn Asn Sar Arg Asn Cya Thr 

20 25 30 

lla Glu Mb B» ly» Arg Glu Pha Pba Pro lla V*l Tyr Lao 21* lla 
35 40 45 

Pha Pha Trp Cly Val Lau Gly Asn Cly Lao Sar lla Tyr Val Pha Lau 
50 55 CO 

Gin Pro Tyr Lys Lya Sar Thr Sar Val Asa Val Pha Hat Lao Asa Lau 
65 70 75 B0 

Ala lla Sar Aap Lau Lau Pha lla Sar Thr Lao Pro Pha Arg J 



90 



95 



i Arg Sar Ala Trp II* Lau Cys Cly 
155 160 

' Xla Hat Lau Lau Aap Sar 

175 

Cy» Lau Glu Lau Asn Leu 
190 

Tyr lla Ala Lau val v»l 

- y 

r Lau Lau lla 



170 
r Sar 



r Lau Sar lla c 

: 

> Val I 
r Zla 3 



»ro Glu Sar Cly 1 
235 

tie lla Thr Lau 1 

!50 

-eu Arg Thr Val n 

urg Lao HI a Lya 
295 

at Cya Pba Asn 



Ala Lau Val 
Pro Lae Leu 



. Lys J 
i Lya 1 



: Lys 



Lau Lys J 
Cya Val I 



<210> 99 

<211> 29 

<212> DMA 

<213> Artificial 



<220> 

<221> xaisc faetur* 
<223> Novel Sequence 



<210> 90 

<211> 29 

<212> DMA 

<213> Artificial 

<220> 

<22i> niic faatura 

<223> Mo»-T Sequenci 



<210> 91 

<211> 1527 

<212> OKA 

<213> Hono aapian* 



ctctccaaaa tgcccatcag 
ttcctcgccg ectetttcgt 
cagctgctgc aggtgaccaa 
atttcgctcg tggccccctg 
agccacttct gcacggccct 
accattgtcg tggtgtcagt 



cagcacgcgc 
cctggcccae 
cggcaacata 
ccgttttatc 
ggtggtggcc 
ggttagcctc 



ggcatcatcc gctcaaccgt gctggttatc 
gtgetggcgc tagtgttgca gcgcasgccg 
tttaacctcc tcgtcaccga cctgctgcag 
♦cctctgtgc ctctcttctg gccccteaac 
acccacctgt tcgccttcgc cagcgtcaac 
ttgtccatca tccaccctct ctcctacccg 
ctcctctatg gcacctggat tgtggccatc 



ctctgctcca tgatctgggg 
atcgtcattc cactgattgt 
aggcagcatg ctctgctgta 
tgtgtggaga atgaggatga 
gagtttcgcc gccagcatga 



ctacggctgg 
ggccagcccc 
catgattgcc 
c«atgtc»»9 
agagggagca 

•ggtgaggtc 



agctacacta ttcteagcgt ggtgtccttc 
tgc&actccg tggtgttctg tgcagcccgg 
agacacagct tggaagtgcg agtcaaggac 
gagaagaagg aggagttcca ggatgagagt 
aaggccaagg agggcagaat ggaagccaag 
acggggacca gtgagagtag tgtagaggcc 
ecggtggcc* gcgacggcag catggagggt 



•atttcagtg aggatgacgt cgaggcagtg eacatcccgg agagcctccc acccagtcgt 
cgtaaeagca eeagcaacce tcctctgccc aggtgctacc agtgcaaagc taagaaagtg 
atcttcatca tcattttctc ctatgtgcta tccctggggc ectaetgctt tttagcagtc 
ctggccgtgt gggtggatgt cgaaacccag gtaccccagt gggtgatcac cataatcatc 
tggcttttct tcctgcagtg ctgcatecac ccctatgtct atggctacat gcacaagace 
itccagga catgctgaag aagttcttct gcaaggaaa* geecccgaa* 



tcctacgatt ctgctacttt tccttga 



<210> 92 

<211> 509 

<212> PUT 

<213> homo aapian* 



120 
190 
240 
300 
360 
4 20 
4 90 
SAO 
600 



960 
1020 
1O80 
1140 
1200 
1260 
1320 
1380 
1440 
1S00 
1527 



PCT/US0O/3J5O9 



<c40C> 92 

Hat Thr Sar Thr Cya Thr Aan Sar Thr Arg Glu 5a r Aan Sar Ser Hi a 



10 



15 



Thr Cya Hat Pro Leu Sar Lya Hat Pro lla Sar Lau Ala Hla Gly lla 

20 25 30 

lla Arg Sar Thr val Lau Val lla Ph. Lau Ala Ala Sar Pha Val Gly 
35 40 45 

Aan lla Val Lau Ala Lau v*l Lau Gin Arg Lya Pro Cln Lao Lau Gin 
50 55 60 

val Thr Aan Arg Pha lla Pba Asn Lau Lao Val Thr Aap Lau Lao Gin 

65 . 70 75 90 

lla Sar Lau Val Ala Pro Trp Val Val Ala Thr Sar Val Pro Leu Pba 



Lya Glu Oly Arg Hat Glu Ala 



3! 



Aap Cly Sar Lau 



SB' 



» Fro 1 

p Tyr C 



410 
I Asp Val 



> Lau Gl y 

r Glu Asp 

t Arg Xr g J 
380 

l Lys Ala I 

k 

rUoClyl 
Glo Thr C 



Glu Asp Asp 

350 

Asp Vol Glu 



Gla Vol Pro 
430 

Uu Rm Fh* 1m Glo Cya Cy» 
445 

Rls Lya Thr Zla Lya Lya Gin 
. 460 

cya Lya Glo Lye Pro Pro Lys 
475 480 

Thr Glo Gly Gly Thr Gill Gly 



<210> 93 

<211> 25 

<212> DMA 

<213> Artificial Stqmnc* 
<220> 

<221> aiiac futon 

<223> Nova! Sequence 



<210> 94 

<211> 29 

<212> DMA 

<213> Artificial Sequence 



<210> 95 
<211> 1092 
<212> UNA 



WO 01/36471 



<211> ROM 
<400> 95 



Ala Val Ala Leu Lou Sor Asn Ala Lao Val Leu Lou Cya Cya Ala Tyr 

20 25 30 

sor Ala Glu Lou Arg Thr Arc Ala Ser Gly Val Lou Lam Val Aon Lou 
35 40 45 

Sor Lou Gly Bio Lou Leu Lou Ala Ala Lou Asp Mat Pro Pbo Thr Loo 
50 55 60 

Leu Gly Val Mot Arg Gly Arg Thr Pro Sor Alo Pro Gly Ala Cya Glo 
65 10 75 80 

vol Ho Gly Pho Lou Asp Thr Pha Lou Ala Sor Aan Ala Alo Loo Sor 

Pay* 57 



PCTAISO0/31S09 



etgggccccg 


gcgaggcgct 


getggegggt 


ctcctggtgo tggtoctggc 


cgtggcgctg 


60 


ctetccaacg 


cactggtgct 


gctttgttgc 


gcctacagcg 


etgegctccg cactcgagcc 


120 


tcaggcgtcc 


tcctggtgaa 


tctgtcgctg 


ggccacctgc tgctggcggc gctggocatg 


ieo 


cccttcacgc 


tgctcggtgt 


gatgcgcggg 


cggacoccgt 


eggcgcccgg 


cgcotgccao 


240 


gtcattggct 


tcctggaeac 


cttcctggcg 


t ceo » eg egg 


egctgagcgt 


ggcggcgctg 


300 


egcgcogacc 


ogtggctggc 


egtgggettc 


ccoctgcgct 


•cgccggecg 


cctgcgeccg 


360 


cgctatgccg 
gcacttggct 


gcctgctgct 
gctcgcggct 
ctgogcgtoe 


gggctgtgcc 
tggctocagc 
gcgcttcgco 


tggggaeogt 
ogcgccttcg 
gcettcoecg 


cgctggcctt 
egtcctgttc 
ccacgctcco 


etcoggeget 
getgegectg 
tgccgtgggc 


420 
«80 
540 


ttcgtgctgc 


cgctggcggt 


gctctgcctc 


acctcgctcc 


oggtgcoccg 


ggtggcacgc 


600 


egccectgce 


egcgcatgge 


ceccgtcacc 


atgaaggege 


tcgcgctgct 


cgccgacctg 


660 


coccccagtg 


tgcggcogcg 


ctgcctcatc 


cagcagoogc 


ggcgccgcca 


ccgcgccacc 


720 


aggaagattg 


gcactgctat 


tgcgaccttc 


ctcatetget 


ttgccccgto 


tgtcatgacc 


780 


oggctggcgg 


agctcgtgcc 


cttcgtcacc 


gtgaaegece 


ogaagggeot 


cctcogcaag 


S40 


tgcctgacct 


acogcaaggc 


ggtogccgac 


ccgttcocgt 


octctctgct 


ccgccggccg 


900 


ttcegccoog 




catggtgcac cggctgctga 


egagaacccc 


gcgcccogca 


960 


tccacceatg 


ocogctctct 
egtceaccca 


ggatgtggcc 
caacggctct 


ggcotggtgc 
gtggacoeog 


accogctgct 
ogaatgattc 


gaegagaacc 

ctgcctgcog 


1020 
1080 


cagacacoet 


oa 










1092 


<210> 96 
<211> 363 
<212> PRT 
<213> Horn 


> sapiens 












<4 00> 96 














Mot Gly Pr< 
1 


> Gly Glu A 
S 


a Leo Lou J 


kla Gly Lou 
10 


Lou val Mot 


Val Lou 

15 





PCT/US00O1509 



<210> 98 

<211> 36 

<212> OKA 

<213> Artificial 



<220> 

<221> nioc feature 
<223> Novel Soquanca 



Ala Gin Lys Gly He Lou Ser Lya Cya Leu Thr Tyr Ser Lyo Ala Val 



Lou Lys Arg Thr Pro Arg Pro Ala Sor Thr Mia Asn Gly sor Vol Asp 
340 345 350 

Thr Glu Aon Asp Ser Cys Loo Clo Glo Thr Bis 
355 360 

<210> 97 

<211> 34 

<212> DMA 

<213> Artificial Sequence 
<220> 

<22l> ad.se feature 

<223> SoWI Soquonce 

<400> 97 

. gatctctago atggagtcct cacccatccc ccag 



gotcgatatc cgtgoctcca gccggggtga ggegge 

<210> 99 
<211> 2610 
<212> DMA 

<213> Hotao sapiens and Rat 

<4 00> 99 

atggagtcct cacccatccc ccagtcatca gggaactctt ccaetttggg gogggtccct 
caoaceccog gtccctctac tgccagtggg gtcceggogg tggggetoeg ggatgttgct 
teggaatctg tggccctctt cttcatgctc ctgctggact tgactgctgt ggetggcoat 
gccgctgtgo tggccgtgat cgccoagacg cctgccctcc goooatttgt cttcgtcttc 
cacctctgcc tggtggrfcct getggctgee ctgaccctca tgcccctggc catgctctcc 
agctctgecc tctttgacca cgccctettt ggggaggtgg cctgccgcct ctacttgttt 
ctgagcgtgt gctttgtcag cctggccotc ctctcggtgt cogccotcaa tgtggogcgc 
tactattacg tagtccaccc cotgogctac gaggtgegco tgacgctggg gctggtggcc 
tctgtgctgg tgggtgtgtg ggtgaaggcc ttggccatgg cttctgtgcc ogtgttggga 
agggtctcct gggaggaegg ogctcceogt gtccccccag getgttcact ccagtggagc 
cacegtgect octgccagct ttttgtggtg gtctttgctg tcctttoctt tetgttgcco 
ctgctcctca tocttgtggt etaetgeoge atyttccgag tggeecgcgt ggctgccotg 
cagcacgggc cgctgcccoc gtggatggog acaccccggc oocgctccga otetctcoge 



120 

1B0 
240 
300 
360 
420 
400 
540. 
600 
660 



gggggaggg* aagcagcegt ggttctcctg gctgtggggg gacagttcct gctctgttgg BOO 

ttgccctact tctctttcca cctctatgtt gecetgagtg ctcageccat ttcaaetggg 960 

caggtggegs gtgtggtcac ctggattggc tocttttget tcacttccoa ccctttcttc 1020 

tatggatgtc tcsaccggco gatceggggg gogctcogca agcagtttgt ctgettettc 1090 

aagccagctc cagaggagga gctgaggctg cctagccggg agggctccet tgaggagooc 1140 

ttcctgcogt tecttcaggg goctggctgt ccttctgagt cctgggtttc ccgaccccta 1200 

cccogcccco ageaggagee occtgccgtt goctttcgoo teceaggeca gotagctgag 1260 

gagacctctg agttcctgga geagcaoetc occegcgaco tcotcotgtc ogacogctac 1320 

Pago 39 



ctcegtcctg 


ccgcctcscc ccgqctggag 


tcagcgatat 


ctgcagaatt ccaeeacact 


1380 




atccgagctc ggteccaage 


ttgggctgea 


ggtcgatggg ctgectcggc 


1440 




ccgeggacce gcgcaacgag 


gegeeggcgc 


agcgegaggc caacaaaaag 


1500 


a tcgagaegc 


agctgcagae ggecsagcsg 


gtctaccggg 


ccacgcaccg cctgctgctg 


1560 


ctgggtgctg 


gegagtctgg caaaagcacc 


attgtgaegc 


agatgaggat cctacaeget 


1*20 


aatgggttte 


acooegeggg cggcgeageg 




ctgcaaggag eaacagcget 


1680 


ggtgegaagg 


ccaccaaagt gcaggacatc 


aaaaacaacc 


tgeaggaggc cattgaaacc 


1740 


attgtggccg 
ttcagagtgg 


ccatgagcaa cctggtgccc 
actecattct gagcgtgatg 


cccgtggagc 
aacgtgccea 


tggcceaccc tgegeaceag 
•ctttgactt cccacctgaa 


1800 
I860 


ttctetgagc 


atgccaagge tctgtgggag 


gatgagggag 


ttcgtgcctg ctacgagcgc 


1820 


tccaacgagt 


accagctgat cgactgtgcc 


cagtacttcc 


tggacaagat tgatgtgatc 


1*80 


m gca ggcco 


aetacgtgcc asgtgacceg 


gacctgctte gctgcegcgt cctgacctct 


2040 


agaetct ttg 


agaccaegtt ceaggtggac 


aaagtcaact 


tecaeatgtt cgatgtgggc 


2100 


ggccegcgcg 


atgaacgccg caegtggatc 


cagtgcttca atgatgtgac tgccatcatc 


2160 


ttcgtggtgg 


ccagcagcag ctacaacatg 


gtcatceggg 


aggacaaeca gaccaaccgt 


2230 


ctgcaggagg 


ctctgaacct cttcaagagc 


atctggaaca 


acegatggct gcgtaccatc 


2280 


tctgtgatcc 


eetteetcM caagcaagat 


ctgcttgctg 


ageaggtcct cgctgggaaa 


2340 


tcgaagattg 


aggactactt tccagagttc 


gctcgctaca 


ccactcctga ggatgcgact 


2400 




gagaggacce acgcgtgacc 


eggqcceagt 


ecttcstecg ggatgagttt 


24 60 


ctgageatca 


gcactgctag tggagatgga 


cgtcectact 


gctaccctca ctttacetgc 


2520 


gccgtggeca 


ctgegaecat ecgccgtgtc 


ttceacgact 


gccgtgacat catccagcgc 


25B0 


atgcatcttc 


gccaatacga gctgctctaa 






261 D 



<210> 100 

<211> 868 

<212> PHT 

<213> Hobo 

<400> 100 

Hat Glu Sox Sar Pro Ila Pro Gin Ser sar Gly Aan Sac Sar Thr Lao 
1 5 10 15 

Gly Arg Val Pro Gin Thr Pro Gly Pro Sar Thr Ala Sar Gly Val Pro 
20 25 30 

Glu val Gly Leo Arg Asp Val Ala Sar Gin Sar Val Ala Lao Pha Pba 
35 40 45 

Mat Lao In Leu Asp Leu Thr Ala val Ala Gly Asa Ala Ala Val Mat 



50 



35 



PCT/US00OISW 



1 Ila Ala Lya Thr Pro 
70 

i Cys Lao Val Asp Lau 



Ala Leo Arg 
Lao Ala Ala 



lys Pha Val Pha Val pha 
?i 60 

Lau Thr Leo net Pro Lou 
85 

Gly Glo 



» Cys 

115 



Arg Lao Tyr Lao Pha Loo Sar * 

120 

i Ila Am Val * 



Val Bla I 
145 

Sar Val I 
Pro val I 
Pro Gly t 



i Val Arg Mat Thr Lao Gly Lao \ 



» Tyr Pha Lau 

S 

: Pha Arg Val 



175 
t Val Pro 



Lau Pro Lao Lau Lao Ila 



i Thr Pro Arg Gin 
t Thr Sar Sar Gl; 



a* 



305 

Gin Val 



Glu Sar Val Val Thr 1 



Sar Lya Gin Pha Val Cys Pha I 



Pro Sar Pro Lya Gin Glo Pro ! 



WO 01/3*471 



PCTAJSOHOISW 



PCT/US0Q/31S09 



Pro Sor Sar Val Pro Sar Lau Gly Cya Arg Sar Mat Gly Cya Lau Gly 
465 470 475 480 

A»n Sar Lys Thr Glu Asp Gin Arg Asn Glu Glu Lya Ala Gin Arg Glu 
485 490 495 

Ala Asn Lys Lys 11a Glu Lys Gin Leu Gin Lys Asp Lys Gin Val Tyr 

500 505 510 



'IS 



Pro His Pha Thr Cys Ala Val Asp Thr Glu Asn Ila Arg. 



Arg Val Pha Aan Asp Cys Arg Asp Ila Ila Gin Arg Mat His Lau Arg 



Gly Glo Lys Ala Thr Lys Val Gin Asp Ila Lya Asn Asn Lau Lys Glu 

575 



<210> 101 

<211> 30 

<212> CKA 

<213> Artificial 



<220> 

<221> od.SC feature 
<2 23> hoy el Sequence 



Sar Aan Glu Tyr Gin Leu lie Asp Cys Ala Gin Tyr Pha Leu Asp Lys 



<210> 102 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> atiac feature 

<223> MoreT Sequence 



<210> 103 

<211> 2781 

<212> DNA 

<213> Hobo Sapiana and Aat 



Asn Asa Arg Trp Leu Arg Thr He Ber Val He Lau Pha Leu Asn Lys 
Page 62 



ctctccaaaa tgcccatcag cctggcccac ggcatcatce gctcaaccgt gctggttatc 120 

ttcctcgccg cctctttcgt eggcaacata gtgctggcgc tagtgttgca gcgcaagccg 180 

cagctgctgc eggtgeccee ccgttttate tttaacctcc tcgtceccga ectgetgeag 240 
Page 63 



•tttcgctcg 
agctacttct 
•ccsttgtcg 



ctctgetcca 
•tcgtcsttc 



tggccccctg ggtggtggcc ecctctgtgc ctctcttctg gccecteaae 
gcacggccct ggttagcctc aeccaeetgt tcyecttcae cagcgtcaec 
tggtgtc*gt ggatcgetac ttgtccatca tccaccctct ctccucccj 
cccagcgccg cggttacctg ctcctctatg gcacctgget tgtggccatc 
ctcctecsct ctaeggetgg ggceaggetg ectttgatga gcgeaatget 
tgatcteggg ggccagcccc egctaceeta ttctcagcgt ggtgtccttc 
cactgattgt catgattgcc tgctactccg tggtgttctg tgcsgcccgg 
ctctgctgta caatgtcaag agacscagct tggaagtgcg agtcaaggac 



tgtgtggaga 
gagtttcgec 



aggggcagcg 



tgaaggccaa ggaaggaagc acggggacca gtgagagtag tgtagaggcc 
aggaggtcag agagageage acggtggeea gcgacggcag catggagggt 



gtcaaccagt gcagcatcga ertgggtgaa gatgacatgg agtttggtga agacgacatc 
natttcogtg aggatgacgt cgaggcagtg aacatcccgg agagcctccc acccagtcgt 
cgtaacagce acagcaaccc tcctetgccc aggtgctacc agtgcaaage tgctaaagtg 
atcttcatca tcattttctc ctetgtgcta tccctggggc cctactgctt tttagcagte 
ctggccgtgt gggtggatgt . cgaaaccceg gtaccccegt gggtgatcac cetaatcatc 
tggcttttct tcctgcagtg ctgcatccac ccctatgtct atggctacat gcacaagaec 
attaagaagg aaatccagga catgctgaag aagttcttct gcaaggaaaa gcceccgaaa 
gaagatagcc ecccagacct gcccggaaca gagggtggga ctgeaggcaa gattgtccct 
tcccacgatt ctgctacttt tcctgegata Tctgcagaat tccaccacae tggactagtg 
gatccgagct cggtaccaag cttgggctgc aggtcgatgg getgectcgg eaacagtaag 
accgaggaee agcgcaacga ggagaaggcg cagegcgagg ceaacaaaaa gatcgagaag 
cagctgcaga aggacaagca ggtctaccgg gceacgceee gcctgctgct gctgggtgct 
ggagagtctg gcaaaagcac cattgtgaag cagatgagga tcctacatgt testgggttt 
aacggagagg gcggcgaaga ggacecgcag gctgcaagga gcaacagcga tggtgagaag 
gcceccaeag tgcaggacat caaaaaeaac etgaaggagg ccattgaaae cattgtggcc 
gccatgagca acctggtgcc ccccgtggag ctggccaecc ctgagaacca gttcegagtg 
gactacattc tgagcgtgat gaacgtgcca aactttgact tcccacctga attctatgag 
catgocaagg ctctgtggga ggatgaggga gttcgtgcct gctacgagcg ctccaacgeg 
teccegctga tcgactgtgc ccagtacttc etggacaaga ttgatgtgat eaagcaggce 
gactacgtgc caagtgacca ggacctgctt cgctgccgcg tcctgacctc tggaatcttt 
Pag* (4 



360 
420 
480 
540 



720 
780 
t40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1BOO 
1860 
1920 
19B0 
2040 
2100 
2160 
2220 



PCTAIS0W3I5W 



gagaccaagt tcceogtgga caaagtcaac ttccacatgt 
gatgaacgcc gcaagtggat ccagtgcttc aatgatgtga 
gccagcagca gctacaacat ggtcatccgg gaggacaaec 
gctctgaaec tcttcaagag catctggaac aacagatggc 
ctcttcctca aeaagcaaga tctgettget gagaaggtec 
gaggactact tteeagagtt cgctcgctac accactcctg 
.ggagaggacc cacgcgtgac ccgggccaag tacttcatcc 
agcactgcta gtggagatgg acgtcactac tgctaccctc 
actgagaaca tccgccgtgt cttcaacgec tgecgtgaca 
cgccaatacg agctgctcta a 

<210> 104 
<211> 926 
<212> PUT 

<213> Hobo sapiens and Rat 

<400> 104 

Mat Thr Ser Thr Cya Thr Asn Sar Thx Arg Glu 1 
15 10 

Thr Cya Hat Pro Lau 3«r Lya Hat Pro 

20 25 

Ila Arg 8ar Thr val Lau val Ila Pha Lau Ala 

35 40 

Asn Ila Val Lao Ala Laa Val Lao Gin Arg Lya 

50 55 

Val Tbr Aan Arg Pba Ila Phe Aan Lau Lau Val 

65 70 75 

Ila Sar Lau Val Ala Pro Trp Val Val Ala Thr 



ccgatgtggg eggccagcgc 
ctgccatcat ettcgtggtg 



tgcgtaccat ctctgtgate 
tcgctgggaa atcgaagatt 
aggatgcgac tcccgagccc 
gogatgagtt tctgagaatc 
aetttacctg cgccgtggae 
tcatccagcg catgcatctt 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2781 



Sar Lao Ala 



85 



90 



Arg Tyr Lau Sar Ila Ila Hla Pro Lau Sar Tyr 



Pro Gin 
60 

Sar val 
Val Scr 

val val 

125 

140 
: Trp Ila * 

f Gin Ala l 

t Ala Scr I 



Sar Sar Hla 

15 

His Gly Ila 
30 

Pha Val Gly 

Lau Lau Gin 

Lau Lau Gin 
■ 0 

Pro Lau Pha 
95 

Lau Thr Hla 

110 

Sar v«i Asp 
Lya Hat Tbr 



PCT/US0OO1509 



rCT/USOO/31509 



Arg Ila Leu Hi* Val Aan Gly pha Aan Gly Glu Gly Gly Glu Glu Asp 



Glu Gly Val Arg Ala Cya Tyr Glu Arg Sat Asn Glu Tyr Gin Lau Ila 



Tyr Asn Hat Val Ila Arg Glu Asp Asn Cln Thr Asn Arg Lou Gin Glu 
783 790 795 800 

) Ala Leo Asn Lau Pha Lya Sar Ila Trp Asn Asn Arg Trp Lau Arg Thr 
605 810 815 



Arg Tyr Thr Thr Pro Glu Asp Ala Thr Pro Glu Pro Gly Glu Asp Pro 

Arg Val Thr Arg Ala Lys Tyr Pha He Arg Asp Glu Pha Lao Arg Ila 

Sar Thr Ala Sar Gly Aap Gly Arg His Tyr Cys Tyr Pro His Pha Thr 

Page 67 



WO 01/36471 



<210> 105 

<211> 23 

<212> DMA 

<213> Artificial Saqu* 
<220> 

<221> nlsc feature 

<223> Novel Sequence 



<210> 106 

<211> 24 

<212> OKA 

<213> Artificial 



<220> 

<221> ml__ 

<223> Novel sequence 



<210> 107 

<21J> 25 

<212> OKA 

<213> Artificial 



<220> 

<221> nlac ftttuit 
<223> NoveT Sequence 



<210> 108 

<211> 26 

<212> DMA 

<213> Artificial 



<220> 

<221> miac feature 
<223> Novel Sequence 



PCT/US0GV3150* 



WO 01/3*471 



<211> 24 

<212> n» 

<213> Artificial Sequence 
<2J0> 

<221> Rise feature 

<223> Novel Sequence 



PCT7US0QO1509 



<220> 

<221> aiac feature 

<223> Hovel Sequence 

<400> 110 



<210> 111 

<211> 26 

<212> DMA 

<213> Artficial 



<220> 

<221> aisc feature 
<223> nova! Sequence 



<210> 112 

<211> 25 

<212» OKA 

<213> Artificial 



<220> 

<22i> miac feature 
<223> HoTal Sequence 



<210> 113 

<211> 24 

<212> DRA 

<213> Artificial 



PCT/USO0/3150* 



WO 01/36471 



PCT/US00/315O9 



<210> 114 

<au> 2? 

<212> cm 

<213> Artificial Sequence 
<220> 

<221> miac feature 

<223> Hovel Sequence 



<210> 115 

<211> 26 

<2I2> OKA 

<213> Artificial 



<220> 

<221> mlac feature 
<223> MoveT Sequence 



ctctgatgcc atetgetooe ttceto 

<210> 116 

<211> 26 

<212> DMA . 

<213> Artificial 



<220> 

<221> mlac faatura 
<223> Novel i 



<210> 117 

<211> 24 

<212> CHA 

<2»3> Artificial 



<220> 

<221> mlac faatura 

<223> Novel Sequence 

<4QO> 111 

tggtgycgat agcceacegc gctc 

<2io> no 

«211> 24 

<212> DRA 

<213> Artificial Sequence 



<220> 

<221> mi ac_ feature 
<223> Novel Sequence 



<210> 119 

<211> 23 

<212> DMA 

<213> Artificial 



<220> 

<221> nlac faatura 
<223> Novel Sequence 



<210> 120 

«211> 23 

<212> OKA 

<213> Artificial 

<220> 

<221> mlac faatura 

<223> Hoi ~ 



<210> 121 

<211> 24 

<212> OKA 

<213> Artificial 

<220> 

<221> mlac faatura 

<223> Novel £ 



<210> 122 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> nine faatura 

<223> Novel i 



Page 71 



<210> 123 

<J1J> 24 

<212> OK* 

<213> Articlal Sequence 
<220> 

<221» Mine f«tur. 

<223> NoveT J 



<400> 123 

gcyctgagcg cngaccagtg gctg 

<210> 124 

<211> 24 

<212> DSA 

<213> Artificial Sequence 
<220> 

<221> stisc feature 
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FURTHER INFORMATION CONTINUED FROM PCT7ISV 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-4 

G protein-coupled receptor as characterised by SE0.ID.2, Its 
non-endogenous, const Itutlvely activated version, a cDNA 
encoding said receptor as characterized by SE0.ID.1, a 
plasmld comprising said cDNA, and a host cell comprising 
said plasmld. 

2. Claims: 5-8 

6 protein-coupled receptor as characterized by SCO. ID. 4, its 
non-endogenous, constltutl vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.3. a 
plasmld comprising said cDNA, and a host cell comprising 
said plasmld. 

3. Claims: 9-12 

6 protein-coupled receptor as characterized by SEO.ID.6, Us 
non-endogenous, constltutl vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.5, a 
plasnld comprising said cDNA, and a host cell comprising 
said plasmld. 

4. Claims: 13-16 

& protein-coupled receptor as characterized by SE0.ID.8. Its 
non-endogenous, constltutl vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.7, a 
plasmld comprising said cDNA, end a host cell comprising 
said plasmld. 

5. Claims: 17-20 

6 protein-coupled receptor as characterized by SEO.ID.10, 
Its non-endogenous, constltutl vely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.9. a 
plasmld comprising said cDNA, and a host cell comprising 
said plasmld. 

6. Claims: 21-24 

6 protein-coupled receptor as characterized by SEQ.ID.12, 
Its non-endogenous, constltutl vely activated version SEQ 
ID. 84, a cDNA encoding said receptor as characterized by 
SEO. ID.ll. a plasmld comprising said cDNA, and a host cell 
comprising said plasmld. 
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7. Claims: 25-28 



6 protein-coupled receptor as characterized by SEQ. 10. 14, 
Its non-endogenous, constltutlvely activated version SEO.ID. 
88, a cONA encoding said receptor as characterized by 
SEQ. ID. 13. a plasald comprising said cDNA, and a host eel) 
comprising said plasald. 



8. Claims: 29-32 



G protein-coupled receptor as characterized by SEQ. ID. 16. 
its non-endogenous, constltutlvely activated version 
SEQ. ID. 92. a cOMA encoding said receptor as characterized by 
SEQ. ID. 15. a plasald comprising said cOHA, and a host cell 
comprising said plasmld. 



9. Claims: 33-36 



6 protein-coupled receptor as characterized by SEQ. ID. 18. 
its non-endogenous, constltutlvely activated version, a d 
encoding said receptor as characterized by SEO.ID. 17, a 
plasmld comprising said cDMA, and a host cell comprising 
said plasmld. 



10. Claims: 37-40 



6 protein-coupled receptor as characterized by SEQ. ID. 20, 
Its non-endogenous, constltutlvely activated version, a cDNA 
encoding said receptor as characterized by SEQ. ID. 19, a 
plasmld comprising said cDNA, and a host cell comprising 
said plasmld. 



11. Claims: 41-44 



6 protein-coupled receptor as characterized by SEQ. ID. 22. 
Its non-endogenous, constltutlvely activated version, a cDNA 
encoding said receptor as characterized by SEQ. ID. 21. a 
plasmld comprising said cDNA, and a host cell comprising 
said plasmld. 



12. Claims: 45-48 



fi protein-coupled receptor as characterized by SEQ. ID. 24, 
Its non-endogenous, constltutlvely activated version, a cDI 
encoding said receptor as characterized by SEQ. ID. 23, a 
plasmld comprising said cDNA. and a host cell comprising 
said plasmld. 
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13. Claims: 49-52 

6 protein-coupled receptor as characterized by SEQ ID 26 
Its non-endogenous, constltutlvely activated version, a cOHA 
encoding said receptor as characterized by SEQ. ID. 25, a 
plasmld comprising said cDNA, and a host cell comprising 
said plasmld. 

14. Claims: 53-56 

6 protein-coupled receptor as characterized by SEQ. ID. 28. 
Its non-endogenous, constltutlvely activated version, a cDNA 
encoding said receptor as characterized by SEQ. ID. 27, a 
Plasmld comprising said cOKA. and a host cell comprising 
said plasald. 

15. Claims: 57-60 

6 protein-coupled receptor as characterized by SEQ. ID. 30, 
Its non-endogenous, constltutlvely activated version, a cOHA 
encoding said receptor as characterized by SEQ. ID. 29, a 
plasald comprising -said cDNA, and a host cell comprising 
said plasald. 

16. Claims: 61-64 

fi protein-coupled receptor as characterized by SEQ. ID. 32, 
Its non-endogenous, constltutlvely activated version, a cDKA 
encoding said receptor as characterized by SEO.ID. 96, a 
plasald comprising said SEO.ID 95, and a host cell 
comprising said plasmld. 

17. Claims: 65-68 

G protein-coupled receptor as characterized by SEQ. 10. 34. 
Us non-endogenous, constltutlvely activated version, a cOHA 
encoding said receptor as characterized by SEO.ID. 33, a 
plasald comprising said cDNA. and a host cell comprising 
said plasald. ■ . 

18. Claims: 69-72 

6 protein-coupled receptor as characterized by SEQ. ID. 36, 
Its non-endogenous, constltutlvely activated version, a cDNA 
encoding said receptor as characterized by SEQ.ID.35, a 
plasmld comprising said cDNA, and a host cell cooprlslng 
said plasald. 
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19. Claims: 73-76 

G protein-coupled receptor as characterized by SEQ. ID. 38, 
its non-endogenous, constltutlvely activated version, a cDNA 
encoding said receptor as characterized by SEQ. ID. 37, a 
plasmld comprising said cDNA. and a host cell comprising 
said plasmld. 

20. Claims: 77-80 

G protein-coupled receptor as characterized by SEQ ID 40 
Its non-endogenous, constltutlvely activated version, a cDNA 
encoding said receptor as characterized by SEQ. ID. 39, a 
plasmld comprising said cDNA, and a host cell comprising 
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